Skip to content
@CACCHT

CACCHT

CACCHT: Creating Annotated Corpora of Classical Hebrew Texts

The CACCHT project is a collaboration of Martijn Naaijer (University of Zurich), Willem van Peursen (Vrije Universiteit Amsterdam), Oliver Glanz (Andrews University), Christian Canu Højgaard (Fjellhaug International University College), Martin Ehrensvärd (University of Copenhagen) and Robert Rezetko (University of Copenhagen).
Together with specialists in the field we develop linguistically annotated datsets of Semitic texts. These datasets are publicly available and can be used freely for research and education. Some datasets have only word-level annotations, while others also contain syntactic features.

Datasets

We are working on the following datasets:

Text-Fabric

All the datasets are Text-Fabric datasets and can be accessed and used with Python.

BHSA

There is an important role for the Biblia Hebraica Stuttgartensia Amstelodamensis (BHSA) in this project. The BHSA is the dataset of the Masoretic Text of the Hebrew Bible with linguistic annotations that is developed and maintained by the ETCBC. In general, CACCHT follows the annotation conventions of the BHSA and we adapt them for the specific characteristics of a language or text.

Popular repositories Loading

  1. caccht.github.io caccht.github.io Public

    Overview of the CACCHT project

  2. .github .github Public

    Overview of the CACCHT project

  3. dss dss Public

    Forked from ETCBC/dss

    Dead Sea Scrolls in TF format based on Abegg's data

    Jupyter Notebook

  4. cuc cuc Public

    Forked from DT-UCPH/cuc

    Contains a text fabric dataset of the Ugaritic corpus.

    HCL

  5. sp sp Public

    Forked from DT-UCPH/sp

    Dataset of the Samaritan Pentateuch

    HCL

  6. syriac syriac Public

    Forked from ETCBC/syriac

    Text-Fabric dataset of Syriac texts

    HCL

Repositories

Showing 6 of 6 repositories

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…