Home

Awesome

<a href="https://github.com/centre-for-humanities-computing/Dacy"><img src="https://github.com/centre-for-humanities-computing/DaCy/raw/main/docs/_static/icon_black_text.png" width="175" height="175" align="right" /></a>

DaCy: An efficient and unified framework for danish NLP

PyPI pip downloads Python Version Ruff documentation Tests

<!-- start short-description -->

DaCy is a Danish natural language preprocessing framework made with SpaCy. Its largest pipeline has achieved State-of-the-Art performance on Named entity recognition, part-of-speech tagging and dependency parsing for Danish. Feel free to try out the demo. This repository contains material for using DaCy, reproducing the results and guides on the usage of the package. Furthermore, it also contains behavioral tests for biases and robustness of Danish NLP pipelines.

<!-- end short-description -->

🔧 Installation

You can install dacy via pip from PyPI:

pip install dacy

👩‍💻 Usage

To use the model you first have to download either the small, medium, or large model. To see a list of all available models:

import dacy
for model in dacy.models():
    print(model)
# ...
# da_dacy_small_trf-0.2.0
# da_dacy_medium_trf-0.2.0
# da_dacy_large_trf-0.2.0

To download and load a model simply execute:

nlp = dacy.load("da_dacy_medium_trf-0.2.0")
# or equivalently (always loads the latest version)
nlp = dacy.load("medium")

To see more examples, see the documentation.

📖 Documentation

Documentation
📚 Getting startedGuides and instructions on how to use DaCy and its features.
🦾 PerformanceA detailed description of the performance of DaCy and comparison with similar Danish models
📰 News and changelogNew additions, changes and version history.
🎛 API ReferencesThe detailed reference for DaCy's API. Including function documentation
🙋 FAQFrequently asked questions
<br /> <details> <summary> Training and reproduction </summary>

The folder training contains a range of folders with a SpaCy project for each model version. This allows for the reproduction of the results.

Want to learn more about how DaCy initially came to be, check out this blog post.

</details> <br />

💬 Where to ask questions

To report issues or request features, please use the GitHub Issue Tracker. Questions related to SpaCy are kindly referred to the SpaCy GitHub or forum. Otherwise, please use the Discussion Forums.

Type
📚 FAQFAQ
🚨 Bug ReportsGitHub Issue Tracker
🎁 Feature Requests & IdeasGitHub Issue Tracker
👩‍💻 Usage QuestionsGitHub Discussions
🗯 General DiscussionGitHub Discussions