scholarly journals Managing a Mass Digitization Project at Meise Botanic Garden: From Start to Finish

2018 ◽  
Vol 2 ◽  
pp. e25912
Author(s):  
Henry Engledow ◽  
Sofie De Smedt ◽  
Quentin Groom ◽  
Ann Bogaerts ◽  
Piet Stoffelen ◽  
...  

Mass digitization is a large undertaking for a collection. It is disruptive of routine and can challenge long-held practises. Having been through the procedure and survived, we feel we have a lot of experience to share with other institutions who are considering taking on this challenge. The changes that digitization has made to our institution are positive and the digitization a success, but that is not to say that we would not have done some things differently, were we to repeat the exercise. In 2015 Meise Botanic Garden received a grant from the Flemish Government to upgrade its digitization infrastructure and mass digitize 1.2 million specimens from its African and Belgian Herbaria. The new infrastructure improved our workflow significantly, enabling us to digitize specimens five to ten times faster while also improving their quality. The mass digitization part of the project was split into two parts, imaging and transcription. The contract was awarded and out-sourced to Picturae, who started imaging in May 2016 using a conveyor belt installation. Prior to starting, a significant amount of preparation was required at the herbarium. Within one year, 1.2 million specimens were imaged. The images were captured as TIFF files and stored in triplicate at The Flemish Institute for Archiving (VIAA), while smaller derived JPEG 2000 and JPEG files were generated for day-to-day use. The second part of the project was label transcription. A third of the specimens were transcribed in-house for capturing minimal data (barcode, filing name, collector, collector number & country of origin). This was partly done to reduce costs, but also allowed us to compare in-house to out-sourced transcription. Some 500,000 specimens were transcribed, either completely or partially, by Alembo (subcontracted by Picturae).The remaining 200.000 specimens from our Belgian Herbarium are being transcribed using crowdsourcing. The latter is being realized through the citizen science platform DoeDat (www.doedat.be) that was launched in November 2017. Many lessons have been learnt with respect to implementing mass digitization, both practically and sociologically. Many of the problems encountered during the project could have been avoided by changing the workflow. The addition of extra control points during the process could have reduced problems encountered later in the data capture process. Solving these problems at a later stage was time consuming. Trying to “save money” can result in a disruptive workflow, which may lead to a number of costly errors. Mass digitization has fundamentally changed the workflow in our collections and the way in which our herbarium is managed. All images for the African and Belgian collections may be now found on our new virtual herbarium www.botanicalcollections.be.

2018 ◽  
Vol 2 ◽  
pp. e26514 ◽  
Author(s):  
Henry Engledow ◽  
Sofie De Smedt ◽  
Ann Bogaerts ◽  
Quentin Groom

There are many ways to capture data from herbarium specimen labels. Here we compare the results of in-house verses out-sourced data transcription with the aim of evaluating the pros and cons of each approach and guiding future projects that want to do the same. In 2014 Meise Botanic Garden (BR) embarked on a mass digitization project. We digitally imaged of some 1.2 million herbarium specimens from our African and Belgian Herbaria. The minimal data for a third of these images was transcribed in-house, while the remainder was out-sourced to a commercial company. The minimal data comprised the fields: specimen’s herbarium location, barcode, filing name, family, collector, collector number, country code and phytoregion (for the Democratic Republic of Congo, Rwanda & Burundi). The out-sourced data capture consisted of three types: additional label information for central African specimens having minimal data; complete data for the remaining African specimens; and, species filing name information for African and Belgian specimens without minimal data. As part of the preparation for out-sourcing, a strict protocol had to be established as to the criteria for acceptable data quality levels. Also, the creation of several lookup tables for data entry was necessary to improve data quality. During the start-up phase all the data were checked, feedback given, compromises made and the protocol amended. After this phase, an agreed upon subsample was quality controlled. If the error score exceeded the agreed level, the batch was returned for retyping. The data had three quality control checks during the process, by the data capturers, the contractor’s project managers and ourselves. Data quality was analysed and compared in-house versus out-sourced modes of data capture. The error rate by our staff versus the external company was comparable. The types of error that occurred were often linked to the specific field in question. These errors include problems of interpretation, legibility, foreign languages, typographic errors, etc. A significant amount of data cleaning and post-capture processing was required prior to import into our database, despite the data being of good quality according to protocol (error < 1%). By improving the workflow and field definitions a notable improvement could be made in the “data cleaning” phase. The initial motivation for capturing some data in-house was financial. However, after analysis, this may not have been the most cost effective approach. Many lessons have been learned from this first mass digitisation project that will implemented in similar projects in the future.


Author(s):  
Natacha Frachon ◽  
Martin Gardner ◽  
David Rae

Botanic gardens, with their large holdings of living plants collected from around the world, are important guardians of plant biodiversity, but acquiring and curating these genetic resources is enormously expensive. For these reasons it is crucial that botanic gardens document and curate their collections in order to gain the greatest benefit from the plants in their care. Great priority is given to making detailed field notes and the process of documentation is often continued during the plants formative years when being propagated. However, for the large majority of plants this process often stops once the material is planted in its final garden location. The Data Capture Project at the Royal Botanic Garden Edinburgh is an attempt to document specific aspects of the plant collections so that the information captured can be of use to the research community even after the plants have died.


Author(s):  
James Clugston

This paper discusses a curatorial review of the Cycadales in the Living Collection at the Royal Botanic Garden Edinburgh (RBGE) following the methodology described by Frachon et al. (2009). It provides an understanding of the cycads in the Living Collection and the diversity of the group represented at RBGE. The paper also describes the determination process and any changes in identification during the course of the project. This project was carried out as part of the second year project of the HND in Horticulture with Plantsmanship at RBGE.


ECA Sinergia ◽  
2019 ◽  
Vol 10 (2) ◽  
pp. 129
Author(s):  
Ramón Antonio Zambrano Mero ◽  
Grether Lucía Real Pérez ◽  
José Raúl Quimis Reyes ◽  
Argelio Antonio Hidalgo Avila

  Las actividades realizadas en los laboratorios de producción de nauplios y poslarvas son el comienzo de la producción de camarón, donde el control de procesos es esencial para los resultados productivos deseados. La investigación se realizó en un laboratorio de producción la parroquia Canoa, provincia de Manabí, con el objetivo de establecer el control de procesos para incrementar los resultados productivos. Se aplica el método: Análisis de Peligros y Puntos Críticos de Control (APPCC). Los resultados obtenidos demuestran que existen factores que no han sido controlados: la temperatura del agua, la energía eléctrica, la presencia de hongos, bacterias, mohos que son PCC en el proceso; su control puede incrementar la supervivencia en un 20 % de la situación actual. El estudio económico en la propuesta realizada arroja un beneficio/costo de 1.97, una tasa de retorno de la inversión de menos de un año, considerándose aceptable el proyecto.   Palabras clave: Método APPCC, riesgo, producción, productividad, camarón   ABSTRACT The activities carried out in the production laboratories of nauplii and postlarvae are the beginning of the production of shrimp, where the control of processes is essential for the desired productive results. The investigation was carried out in a production laboratory in the Canoa parish, province of Manabí, with the objective of establishing process control to increase the productive results. The method is applied: Hazard Analysis and Critical Control Points (HACCP). The results obtained show that there are factors that have not been controlled: the temperature of the water, the electrical energy, the presence of fungi, bacteria, molds that are PCC in the process; its control can increase survival by 20% of the current situation. The economic study in the proposal produced a benefit / cost of 1.97, a rate of return on investment of less than one year, considering the project acceptable.   Key words: HACCP method, risk, production, productivity, shrimp


Author(s):  
K. Li ◽  
S.J. Li ◽  
Y. Liu ◽  
W. Wang ◽  
C. Wu

At the present, in trend of shifting the old 2D-output oriented survey to a new 3D-output oriented survey based on BIM technology, the corresponding working methods and workflow for data capture, process, representation, etc. have to be changed.Based on case study of two buildings in the Summer Palace of Beijing, and Jiayuguan Pass at the west end of the Great Wall (both World Heritage sites), this paper puts forward a “structure-and-type method” by means of typological method used in archaeology, Revit family system, and the tectonic logic of building to realize a good coordination between understanding of historic buildings and BIM modelling.


2017 ◽  
Vol 1 ◽  
pp. e20194
Author(s):  
Henry Engledow ◽  
Sofie De Smedt ◽  
Quentin Groom ◽  
Ann Bogaerts ◽  
Piet Stoffelen ◽  
...  

2012 ◽  
Vol 6 (1-2) ◽  
pp. 111-119 ◽  
Author(s):  
Elspeth Haston ◽  
Robert Cubey ◽  
David J. Harris

Logistically, the data associated with biological collections can be divided into three main categories for digitisation: i) Label Data: the data appearing on the specimen on a label or annotation; ii) Curatorial Data: the data appearing on containers, boxes, cabinets and folders which hold the collections; iii) Supplementary Data: the data held separately from the collections in indices, archives and literature. Each of these categories of data have fundamentally different properties within the digitisation framework which have implications for the data capture process. These properties were assessed in relation to alternative data entry workflows and methodologies to create a more efficient and accurate system of data capture. We see a clear benefit in the prioritisation of curatorial data in the data capture process. These data are often only available at the cabinets, they are in a format suitable for allowing rapid data entry, and they result in an accurate cataloguing of the collections. Finally, the capture of a high resolution digital image enables additional data entry to be separated into multiple sweeps, and optical character recognition (OCR) software can be used to facilitate sorting images for fuller data entry, and giving potential for more automated data entry in the future.


Itinerario ◽  
2000 ◽  
Vol 24 (2) ◽  
pp. 146-169 ◽  
Author(s):  
Michael Leroy Oberg

In August of 1587 Manteo, an Indian from Croatoan Island, joined a group of English settlers in an attack on the native village of Dasemunkepeuc, located on the coast of present-day North Carolina. These colonists, amongst whom Manteo lived, had landed on Roanoke Island less than a month before, dumped there by a pilot more interested in hunting Spanish prize ships than in carrying colonists to their intended place of settlement along the Chesapeake Bay. The colonists had hoped to re-establish peaceful relations with area natives, and for that reason they relied upon Manteo to act as an interpreter, broker, and intercultural diplomat. The legacy of Anglo-Indian bitterness remaining from Ralph Lane's military settlement, however, which had hastily abandoned the island one year before, was too great for Manteo to overcome. The settlers found themselves that summer in the midst of hostile Indians.


1999 ◽  
Vol 173 ◽  
pp. 327-338 ◽  
Author(s):  
J.A. Fernández ◽  
T. Gallardo

AbstractThe Oort cloud probably is the source of Halley-type (HT) comets and perhaps of some Jupiter-family (JF) comets. The process of capture of Oort cloud comets into HT comets by planetary perturbations and its efficiency are very important problems in comet ary dynamics. A small fraction of comets coming from the Oort cloud − of about 10−2− are found to become HT comets (orbital periods < 200 yr). The steady-state population of HT comets is a complex function of the influx rate of new comets, the probability of capture and their physical lifetimes. From the discovery rate of active HT comets, their total population can be estimated to be of a few hundreds for perihelion distancesq <2 AU. Randomly-oriented LP comets captured into short-period orbits (orbital periods < 20 yr) show dynamical properties that do not match the observed properties of JF comets, in particular the distribution of their orbital inclinations, so Oort cloud comets can be ruled out as a suitable source for most JF comets. The scope of this presentation is to review the capture process of new comets into HT and short-period orbits, including the possibility that some of them may become sungrazers during their dynamical evolution.


Sign in / Sign up

Export Citation Format

Share Document