All Tools
D
DataFreeOpen Source
DOCCANO
Open-source text annotation platform for NLP datasets
MIT
ABOUT
Building high-quality NLP datasets requires teams to annotate text with labels, entities, and relationships, but most annotation workflows rely on spreadsheets, custom scripts, or expensive proprietary tools. Doccano gives teams a free, web-based platform where they can upload data, define labeling schemas, collaborate with annotators, and export labeled datasets without writing any annotation infrastructure code.
INSTALL
pip install doccanoINTEGRATION GUIDE
1. Label named entities in text documents for training NER models with custom entity types
2. Classify text passages by sentiment, topic, or intent to build text classification datasets
3. Annotate sequence-to-sequence pairs for summarization, translation, or paraphrase training data
4. Coordinate annotation projects across multiple human labelers with role-based project management
5. Export annotated datasets in multiple formats for downstream machine learning pipelines
TAGS
annotationdata-labelingnlptext-classificationnamed-entity-recognitionsequence-labelingopen-sourcepython