CARLoS: Retrieval via Concise Assessment Representation of LoRAs at Scale
By: Shahar Sarfaty , Adi Haviv , Uri Hacohen and more
Potential Business Impact:
Finds the best AI image tools for any task.
The rapid proliferation of generative components, such as LoRAs, has created a vast but unstructured ecosystem. Existing discovery methods depend on unreliable user descriptions or biased popularity metrics, hindering usability. We present CARLoS, a large-scale framework for characterizing LoRAs without requiring additional metadata. Analyzing over 650 LoRAs, we employ them in image generation over a variety of prompts and seeds, as a credible way to assess their behavior. Using CLIP embeddings and their difference to a base-model generation, we concisely define a three-part representation: Directions, defining semantic shift; Strength, quantifying the significance of the effect; and Consistency, quantifying how stable the effect is. Using these representations, we develop an efficient retrieval framework that semantically matches textual queries to relevant LoRAs while filtering overly strong or unstable ones, outperforming textual baselines in automated and human evaluations. While retrieval is our primary focus, the same representation also supports analyses linking Strength and Consistency to legal notions of substantiality and volition, key considerations in copyright, positioning CARLoS as a practical system with broader relevance for LoRA analysis.
Similar Papers
AutoLoRA: Automatic LoRA Retrieval and Fine-Grained Gated Fusion for Text-to-Image Generation
CV and Pattern Recognition
Lets computers create many different pictures easily.
LoRAtorio: An intrinsic approach to LoRA Skill Composition
CV and Pattern Recognition
Combines many art styles to create new pictures.
Rank-1 LoRAs Encode Interpretable Reasoning Signals
Machine Learning (CS)
Makes AI smarter with tiny, understandable changes.