Agents can now:Documentation Index
Fetch the complete documentation index at: https://docs.agno.com/llms.txt
Use this file to discover all available pages before exploring further.
- Turn text, images, audio, video, and PDFs into structured records.
- Assign labels, label sets, taxonomy paths, and labeled spans.
- Score and rank model outputs for evals and preference data.
- Add a reviewer and an adjudicator when label quality matters.
output_schema.
Agno is natively multimodal and type-safe, so the full labeling stack can be built in pure python.
Example
Here’s a quick example classifying reviews into{positive, negative, neutral} using gemini-3.5-flash. The agent outputs a valid Classification object.
cookbook/data_labeling/_01_text_classification/basic.py
Data labeling workflows
Pick the page that matches what you need.| Workload | Input | Output | Page |
|---|---|---|---|
| Data extraction | Any modality | Typed Pydantic object | Data extraction |
| Classification | Any modality | One label, label set, or spans | Classification |
| Scoring / evaluation | Prompt + response | Rubric scores | LLM as judge |
| Preference ranking | Prompt + two responses | Winner + rationale | Preference data |
| Non-text input | Image, audio, video, PDF | Any of the above | Multimodal inputs |
| Reviewed labels | Any input | Adjudicated label + audit trail | Quality pipeline |
Model choice
We usegemini-3.5-flash across the cookbooks because it handles text, image, audio, video, and PDF. Agno is model-agnostic, so you can swap models as needed.
Explore
Data extraction
Turn any modality into a typed object, with optional per-field confidence.
Classification
Single-label, multi-label, hierarchical, and span labeling.
LLM as judge
Score outputs against a rubric. The same machinery, used for evals.
Preference data
Rank A vs B for RLHF and DPO datasets.
Multimodal inputs
Feed images, audio, video, and PDFs into any labeler.
Quality pipeline
Two labelers, a reviewer, and an adjudicator with an audit trail.