Hierarchical Re-Classification: Combining Animal Classification Models with Vision Transformers
By: Hugo Markoff, Jevgenijs Galaktionovs
Potential Business Impact:
Identifies animals by species, not just groups.
State-of-the-art animal classification models like SpeciesNet provide predictions across thousands of species but use conservative rollup strategies, resulting in many animals labeled at high taxonomic levels rather than species. We present a hierarchical re-classification system for the Animal Detect platform that combines SpeciesNet EfficientNetV2-M predictions with CLIP embeddings and metric learning to refine high-level taxonomic labels toward species-level identification. Our five-stage pipeline (high-confidence acceptance, bird override, centroid building, triplet-loss metric learning, and adaptive cosine-distance scoring) is evaluated on a segment of the LILA BC Desert Lion Conservation dataset (4,018 images, 15,031 detections). After recovering 761 bird detections from "blank" and "animal" labels, we re-classify 456 detections labeled animal, mammal, or blank with 96.5% accuracy, achieving species-level identification for 64.9 percent
Similar Papers
Zero-Shot Wildlife Sorting Using Vision Transformers: Evaluating Clustering and Continuous Similarity Ordering
CV and Pattern Recognition
Organizes animal photos without knowing all animals.
Multi-Label Plant Species Prediction with Metadata-Enhanced Multi-Head Vision Transformers
CV and Pattern Recognition
Helps computers identify many plants in one picture.
Reevaluating Automated Wildlife Species Detection: A Reproducibility Study on a Custom Image Dataset
CV and Pattern Recognition
Helps cameras identify wild animals from pictures.