LeafMachine2

Post-Publication Updates!

March 22, 2025

New VoucherVisionGO API!

We're excited to announce the release of the VoucherVisionGO API, a simplified way to access VoucherVision's AI transcription capabilities for museum specimens. This API provides the best OCR/LLM combination for AI transcription using Google Gemini models.

The API is designed for integration with your projects and portals (Symbiota, Specify, etc.) and is simpler to use than the full VoucherVision implementation.

The VoucherVisionGO API offers:

Single image file upload processing
Single image URL processing
Batch processing of URLs from text or CSV files
Batch processing of folders containing images
Results delivered in CSV or JSON format

Try VoucherVisionGO API

September 5, 2024

This update includes:

Very important update, LM2 finally stores data in a SQLite database, which makes exporting and manipulating data much easier.
Updated summary image (attached) with the keypoints plotted instead of the pseudolandmarks. Both options are available, but the keypoints are actually used to orient the leaf masks.

New Summary Image: Click the image to see it in more detail. Keypoints instead of pseudo-landmarks. Also, in the top left I now include a pixel-to-metric verification overlay. For each ruler in the image there can be a green square (calculated mean conversion factor) and a red square (predicted conversion factor). The predicted CF is based only on a linear projection of the image resolution and assumes that the input image is a standard herbarium specimen photographed on an 11x17 page. The predicted CF is only used if the calcualted CF is not available. This demo image is just to stress test by adding lots of different rulers.

May 12, 2024 (v-2-3)

This update includes:

A new leaf segmentation algorithm trained on 3x more training images. It was also trained longer; it should produce more refined leaf masks.
You can use LeafMachine2 to censor any archival component (rulers, labels, colorcards, etc.) with the color of your choice. Can be run separately from the rest of LM2 using CensorArchivalComponents.py and CensorArchivalComponents.yaml files. This is helpful for classification tasks when you don't want the model to be able to read the label text or link certain items with a certain class. An example image is below.
I added a simple phenology detection tool. LM2 counts all of the detections from the PCD as a proxy for phenology. This runs by default with LM2, or you can run just the phenology detector via the DetectPhenology.py and DetectPhenology.yaml files. Running it seprately from the rest of LM2 is very efficient; processing 120,000 images only takes about 4 hours.
I significantly improved the pixel to metric conversion performance. Now we can predict the conversion factor (CF) based on the image resolution, which then helps the LM2 algorithms detect the true CF's signature, cross-validating available units for each ruler. The new predicted CF acts as a baseline; if the algorithmically determined CF falls within 10% of the predicted CF, then we have extremely high confidence that the calculated CF is actually correct. For a 25 MP image from NYBG, the difference between the predicted CF and the True CF is less than the width of one of the ruler's tick marks. Note: this only works with standard herbarium images. If you are using LM2 with custom images, your mileage may vary.
I reprocessed the D-3FAM dataset with this release (default settings, PCD set to 70%) and uploaded all output files to Zenodo. The original Zenodo repo only included PDFs of the summary images, but this includes everything. Link to all D-3FAM output files.

Censored Labels: You can pick the color to use to censor the labels (or any of the ACD classes). Here I censored everything. If you are training an image classifier, you will want to do this. ML models really like to focus on archival differences between images, since they are more prevalent and obvious patterns than the differences within the actual plant material.

March 25, 2024 (v-2-2):

This update includes:

We have released a new and improved Plant Component Detector (PCD)! This is the YOLOv5 object detection algorithm that is responsible for locating leaves, flowers, fruit, roots, etc.

What changed? Since publication we have quadrupled the size of our training dataset. The new PCD is trained on 20,000+ full resolution images and 600,000+ groundtruth bounding box annotations. In the paper we describe a few situations where the original PCD struggled. There was a small-leaf bias, so the new training dataset adds more large-leaf taxa including:

Alismataceae Sagittaria
Araceae Monstera
Sapindaceae Acer
Fagaceae Quercus
Bignoniaceae Catalpa

Phellinaceae Phelline
Piperaceae Piper
Platanaceae Platanus
Melastomataceae Blakea
Rubiaceae Posoqueria

Bixaceae
Pentaphragmataceae
Torricelliaceae
Aponogetonaceae Aponogeton
Euphorbiaceae Macaranga

The new training dataset also includes the following images to improve generalizability across taxa and scenarios:

1,100+ bryophytes from the Oregon State University herbarium, courtesy of Dr. James Mickley.
9,000+ images from the LeafSnap dataset to bolster performance for single-leaf images. We did not use the LeafSnap provided segmentation masks. Instead, we manually segmented the RGB images to conform to our class definitions. Please see LeafSnap.com.
500+ field images (using FieldPrism photogrammetric templates), courtesy of Dr. Chuck Cannon, Claire Henley, and Arnan Pawawongsak, from the Morton Arboretum.

The new PCD is called LeafPriority and is optimized for locating woody perennial associated components including leaves, flowers, fruits, roots, etc., but has been retrained to handle the 'specimen' class in a different way. The 'specimen' class is designed to detect plant material when typical broad leaves are not present, like a bryophyte image. The new PCD improves all downstream machine learning components of LeafMachine2 (segmentation and landmarking) and also improves the SpecimenCrop feature, which is designed to automate the tedious process of removing whitespace from specimen images.

What did we learn? For LeafPriority training we removed all groundtruth bounding boxes for the 'specimen' class if whole or partial leaves were also present in the image. So while the original dataset contained more than 25,000 groundtruth examples of the 'specimen' class, LeafPriority was trained on only 2,000 examples of the 'specimen' class. Since the 'specimen' class originally encompassed all other plant classes, it erroneously linked large dimension bounding boxes (bounding boxes that occupy most of the image) only with the 'specimen' class, making it much less likely to correctly identify very large leaves or leaves that occupy most of the field of view, stemming from the YOLOv5 model's reliance on anchor bounding boxes for anticipating object locations.

How can I use the new model? The new model will be downloaded by default with new installations of LeafMachine2. If you already have LeafMachine2 installed it should automatically download the new version once you run a `git pull` to update your local repo. If using the LeafMachine2 GUI, the new default is LeafPriority. If you manually edit the LeafMachine2.yaml file, we include instructions for selecting the new or old model.

What's next? We have 5x the number of leaf segmentations since publication, stay tuned for an updated version! We also have a pose detection model trained to identify landmarks, which also significantly improves unpon the existing landmark detection algorithm.

Comparing the old PCD to the new LeafPriority PCD: Large leaves are much more likely to be detected. With the old PCD, most of the training data were densely packed herbarium specimens, so the PCD actually worked better with dense oeverlapping specimens and struggled with isolated leaves. LeafPriority no longer places 'specimen' bounding boxes (the black boxes) around isolated leaves, significantly improving detection accuracy, as you can see in the Ginkgo biloba and Castanea dentata images. The bryophyte images show a huge improvement; LeafPriority only places a single 'specimen' box around the material, instead of the messy Original detection.

Abstract

From leaves to labels: Building modular machine learning networks for rapid herbarium specimen analysis with LeafMachine2

Premise: Quantitative plant traits play a crucial role in biological research. However, traditional methods for measuring plant morphology are time-consuming and have limited scalability. We present LeafMachine2, a suite of modular machine learning and computer vision tools that can automatically extract a base set of leaf traits from digital plant datasets.

Methods: LeafMachine2 was trained on 494,766 manually prepared and expert-reviewed annotations from 5597 herbarium images obtained from 288 institutions, representing 2663 species and employs object detection and segmentation algorithms to isolate individual leaves and petioles. Our landmarking network identifies and measures nine pseudo-landmarks that occur on most broadleaf taxa. Archival processing algorithms prepare labels for optical character recognition and interpretation, while reproductive organs are scored.

Results: LeafMachine2 can extract trait data from at least 245 angiosperm families and calculate pixel-to-metric conversion factors for 26 commonly used ruler types.

Discussion: LeafMachine2 is a highly efficient tool for generating large quantities of plant trait data, even from occluded or overlapping leaves, field images, and non-archival datasets. Our project, along with similar initiatives, has made significant progress in removing the bottleneck in plant trait data acquisition from herbarium specimens and shifted the focus towards the crucial task of data revision and quality control.

Cite LeafMachine2

Weaver, W. N., and S. A. Smith. 2023. From leaves to labels: Building modular machine learning networks for rapid herbarium specimen analysis with LeafMachine2. Applications in Plant Sciences. e11548. doi:10.1002/aps3.11548

Cite LeafMachine

Weaver, W. N., J. Ng, and R. G. Laport. 2020. LeafMachine: Using machine learning to automate leaf trait extraction from digitized herbarium specimens. Applications in Plant Sciences 8(6): e11367. doi:10.1002/aps3.11367

Related Work: Cite FieldPrism

Weaver, W. N., and S. A. Smith. 2023. FieldPrism: A system for creating snapshot vouchers from field images using photogrammetric markers and QR codes. Applications in Plant Sciences 11(5): e11545. doi:10.1002/aps3.11545

Related Work: Cite VoucherVision (AJB)

Weaver, W. N., B. R. Ruhfel, K. J. Lough, and S. A. Smith. 2023. Herbarium specimen label transcription reimagined with large language models: capabilities, productivity, and risks. American Journal of Botany. doi:10.1002/ajb2.16256

Related Work: Cite VoucherVision (BISS)

Weaver WN, Lough K, Smith SA, Ruhfel B (2023) The Future of Natural History Transcription: Navigating AI advancements with VoucherVision and the Specimen Label Transcription Project (SLTP). Biodiversity Information Science and Standards 7: e113067. doi.org/10.3897/biss.7.113067

Try VoucherVision!

VoucherVision is a multi-institution collaboration with the goal of significantly automating specimen label transcription using Large Language Models.

Or try it on VoucherVision's Hugging Face Space

Links and Code

LeafMachine2 on GitHub

LeafMachine2 source code, installation instructions, and user guide are available on GitHub.

Demo Results

View the D-3FAM test dataset from the publication. As of May 12, 2024 this repo includes the full LeafMachine2 v-2-3 output, which has masks, measurements, conversion factors, etc. The original repo only included PDFs of the summary images.

Processed Image Gallery - Ideal Leaves

View a few summary images produced by LeafMachine2. Additional images can be viewed at the Zenodo repository.

These are sample images produced with a Plant Component Detector confidence of 50%, ignoring parital leaves (green masks).

Processed Image Gallery - Ideal and Partial Leaves

View a few summary images produced by LeafMachine2. Additional images can be viewed at the Zenodo repository.

These are sample images produced with a Plant Component Detector confidence of 50% and includes partial leaves (blue masks).

If you cannot see the form, please visit our inquiry form here

Submit Logs

Logs

Submit a log file for assistance.

Publication Figures

Click to enlarge

Figure 1: LeafMachine2 workflow. A batch of images is processed by the ACD (2) and PCD (6) component detection networks. (2) Bounding boxes identifying predicted plant components. Each bounding box identifies a unique component, directing it to the appropriate processing pipeline. (3) The PCD produces cropped images of each plant component. (4) Individual cropped leaves undergo instance segmentation by the Detectron2 network, producing leaf outline masks for ideal leaves (green) and optionally partial leaves (blue). The first set of images shows individual leaves, while the second set shows the compilation of the individual leaves back onto the full specimen image. (5) Cropped ideal leaves are processed by the PLD and individual landmarks are measured. Please see Figure 2 for a description of each landmark annotation. (6) Bounding boxes identifying predicted archival components. (7) Cropped archival components from the ACD are processed and cleaned into binary images for downstream applications, like optical character recognition (OCR) or interpretation by Large Language Models (LLMs). (8) The cropped ruler image is processed by our scanline or template matching algorithms to identify unit markers. Located tick marks are shown as colored dots. Green and cyan lines indicate the converted one- and five-centimeter distances for quality control purposes. For more information about pixel-to-metric conversion, please see Appendices S2 and S3. (9) The final overlay image shows all machine-derived masks, measurements, and identified components. All the visuals in this figure are sourced directly from the output files produced by LeafMachine2.

Figure 2: Qualitative performance of LeafMachine2 by family and task. Qualitative performance of LeafMachine2 across 341 plant families, as identified by the home herbaria. We visually inspected LeafMachine2’s quality control summary images for the 831 species/images in the D-3FAM test dataset produced with default setting and a PCD confidence of 50%. We followed a power ranking scheme to assign qualitative ratings to families with more than one image, conservatively rounding down in the case of split ratings between the images. For leaf segmentation, a “good” rating indicates that most leaf masks are high-quality, a “marginal” rating indicates that usable masks are present but require manual filtering, and a “poor” rating indicates that no usable masks are present. For landmarks, a “good” rating indicates that at least one usable and accurate landmark skeleton was present, a “marginal” rating indicates that only partial landmark skeletons were present, and a “poor” rating means that no landmarks could be identified. For component identification, a “good” rating means that LeafMachine2 scored the presence of all non-laminar organs, but not necessarily all instances of each organ. A “marginal” rating indicates that some non-laminar organs were not identified, while “poor” means that LeafMachine2 misidentified or failed to identify most non-laminar organs. Bolded families were included in the LeafMachine2 training dataset. (A) An image of Lauraceae Umbellularia californica as an example of “good” ratings in all categories. (B) An image of Myricaceae Morella cerifera as an example of “marginal” ratings in all categories. (C) An image of Sarcobataceae Sarcobatus vermiculatus as an example of “good” ratings in all categories.

Figure 3: Leaf detection with archival and non-archival datasets, with varying PCD confidence. The left column is the original image. Ordered by decreasing levels of PCD confidence from left to right are full image masks of ideal leaves (or leaflets). (A) Herbarium voucher of Fagaceae Quercus coccinea. (B) Herbarium voucher of Apodanthaceae Pilostyles blanchetii. (C) Herbarium voucher of Stilbaceae Brookea_tomentosa. (D) FieldPrism-processed field image of Fagaceae Quercus havardii, courtesy of the Morton Arboretum. (E) Leafscan image of Sapindaceae koelreuteria paniculata. (F) iNaturalist-style photograph of Nyssaceae Nyssa sylvatica, photo credit William Weaver.

Figure 4: Segmentation and pseudo-landmark examples. All leaves are from the D-3FAM dataset and were not part of the segmentation of landmarking datasets. Ideal leaves, as predicted by the PCD, are green masks while partial leaves are blue masks. (Leaves A-Q) A sample of leaves demonstrating segmentation performance when leaves have complex outlines, are obstructed by mounting tape, overlap other leaves, or a combination of obstructions, notably leaves L, P, and Q. (Leaves R-V) A sample of leaves showing show pseudo-landmark performance. For landmark overlay images, the red line is lamina width, the cyan line traces the petiole, the solid black line traces the midvein, the dotted white line is the line of best fit for the points that comprise the midvein, the solid white line is the base to tip length, blue bullseye points are lobe tips, pink angles are less than 180 degrees, orange angles are reflex angles greater than 180 degrees, the green dot is the lamina tip, the solitary red dot is the lamina base. Green bounding boxes are the minimal rotated bounding box. Petioles are either pink or orange masks and holes are purple. Leaf V shows bounding boxes around fruit and buds.

Figure 5: Ruler conversion performance. (A) The 37 ruler types that our ruler classifier was trained to recognize, arranged from best performing to worst, left to right. Rulers 30-37 are block-based rulers that can be identified but not converted but are well-suited for our template-matching procedures and will be supported in future iterations. The colored boxes below each ruler correspond to the CF determination success rate within the dataset R-CLASS. The numerator is the proportion visually assessed to be a correct conversion based on the quality control output (see Appendix S2, images 1-38) and the denominator is the total number of rulers of that class present in the dataset R-CLASS. Rulers with a zero can be identified by the ruler classifier but were not present in the R-CLASS. Colored shape identifiers are placed above each ruler image for the ruler classes that are present in both datasets R-CLASS and D-3FAM. (B) A t-test between manually obtained CFs and autonomously generated CFs for 708 rulers in the test dataset D-3FAM. The y-value of each point is the percent difference from the manually converted CF (left y-axis). Points are sorted by autonomous CF pooled standard deviation, lower values to the left and higher values to the right (right y-axis). Inconsistently converted rulers have higher index values, consistent rulers have lower index values. Accurate autonomous conversions fall between the average RSD dotted lines. The two recommended ruler types (rulers 2 and 7) are denoted by green star shaped markers.

Feature Roadmap

What's coming in future releases!

Training Dataset v.1.0

This dataset was used to train all LeafMachine2 algorithms. For more details, please see the 2023 publication.

Training Dataset v.1.1

A larger training dataset with more taxonomic diversity. Highlights include:

Over twice as many training images for the landmark detector
More than 1,000 additional species
Support for additional plant organs and measurements

Module: Armature Detector

Our first add-on module will be an armature detector capable of locating and measuring prickles, thorns, and spines.

Modules are designed to be more taxonomically focused, extending the detection and measurement support of the base LeafMachine2 capabilities.

Integrated FieldPrism Support

FieldPrism creates curated snapshot vouchers for quantitative trait collection. This release will apply FieldPrism image processing methods when it LeafMachine2 detects the FieldPrism FieldSheet in an image, streamlining the workflow.

Check out FieldPrism!