← Back to Vision Models LFM2.5-VL-1.6B is Liquid AI’s flagship vision-language model, delivering exceptional performance on image understanding, visual reasoning, and multimodal tasks. Built on LFM2.5 with a dynamic SigLIP2 image encoder.Documentation Index
Fetch the complete documentation index at: https://liquidai-fix-android-sdk-qa-issues.mintlify.app/llms.txt
Use this file to discover all available pages before exploring further.
Specifications
| Property | Value |
|---|---|
| Parameters | 1.6B |
| Context Length | 32K tokens |
| Architecture | LFM2.5-VL (Dense) |
Image Captioning
Detailed descriptions and alt-text
Visual Reasoning
Scene understanding and visual Q&A
OCR & Extraction
Text recognition and document parsing
Quick Start
- Transformers
- vLLM
- SGLang
- llama.cpp