- You can get the metadata from somewhere else in csv format.
- You want to experiment on a collection. And you manage metadata in a separate csv file and do not need to reenter all the metadatas after you screw up the collection.
Greenstone has a metadatacsv plugin and unfortunately the author cannot figure out how to use it. Alternatively, Greenstone has a explode metadata mechanism which is very useful. Let us detail it.
Let us first assume we have three files in Greenstone:
a.txt, b.txt, c.html
Filename,dc.Title,dc.Creator,Description,Contributor a.txt, a, GZ, first test, GJZ b.txt, b, GZ, second test, GJZ c.html, c, GZ, third one, GJZ
A couple of point to notice for this file:
- The first line is the label
- The name of the label must exactly match the your metadata name (including the case, title will be mapped into "dc.title", instead of the standard "dc.Title")
- Later we will choose the default metadata set, the non-qualified label will be mapped into this default set (Dublin Core in this case)
- One can use a simple text editor do it.
- Google Doc is also an excellent tool with the built-in collaborating function. One can create a spreadsheet first, fill it with your team mates, then "download as" a csv file.
Here is the procedure in GLI
- (optional) create a new collection
- In the "Gather" tab, add all three files (a.txt, b.txt, c.html)
- In the "Gather" tab, add meta.csv (a window will pops up asking about plugin to use, choose either one and this does not matter much)
- In the "Enrich" tab, right-click "meta.csv", choose "explode metadata database", a new window will pop up (it shows as CSVPlugin, it is fine)
- choose "metadata_set" accordingly (Dublin Core in the example), this will be your default set, all your non-qualified metadata will be mapped into this (in our example, "Description" is mapped as "dc.Description")
- tick "document_field", enter the label of the file name column ("Filename" in the example)
- click "explode"
- In the Enrich tab, you will notice that all three files becomes the sub-levels of meta. And the metadata field is populated. You can enter more
- Build the collection, tweak the display, do the rest
Adding metadata with a new metadata scheme
Sometimes one has some special metadata fields that do not fit any existing schema. One might want to create a new one from scratch or by modifying the existing one. This can be done in Greenstone GEMS. Or one can do the same in GLI "Manage Metadatasets..." in "Enrich" tab.
I made a new metadata set "my new DC" from Dublin Core. So it has no complication of the sublevel, all flat. I specify it with a new "namespace" ---"mdc" in order to distinguish it from normal dc. I then use the same procedure as in the coruse website to explode a csv metadata file and it work fine.
1 comment :
Thank you so very much! Helped a lot. I was struggling with the Excel csv for 6 hours straight now.
Post a Comment