Copenhagen Ugaritic Corpus

This repo contains a text fabric dataset of the Ugaritic text corpus. It is work in progress.

The CACCHT project: Creating Annotated Corpora of Classical Hebrew Text

This dataset is developed as part of the CACCHT project, which is a collaboration of Christian Canu Højgaard, Martijn Naaijer, Martin Ehrensvärd, Robert Rezetko, Oliver Glanz, and Willem van Peursen. The goal of CACCHT is to prepare and publish ancient Semitic texts digitally, that can be used for research.

For this dataset, we cooperate with Tania Notarius (University of the Free State) and Maria Simion, volunteer assistant (Polis - the Jerusalem Institute of Language and Humanities).

Data

The following tablets of Die keilalphabetischen Texte aus Ugarit (KTU) are currently available:

KTU 1.1-1.7
KTU 1.14-1.22
KTU 2.5-2.18
KTU 2.20-2.27
KTU 2.30-2.32
KTU 2.34-2.44
KTU 2.46-2.75
KTU 2.77-2.80
KTU 2.82-2.100

The texts are currently annotated with the following features:

tablet: tablet title
column: column number
line: line number
side: tablet side of inscription
g_cons: a consonantal representation of each word in Latin script
trailer: a representation of word spacing or word dividers
language: Ugaritic
sign: Letter in Latin script
emen: emendations of various sorts in relation to a sign (including reconstructed, missing, excised, or redundant signs/letters)
cert: certainty of the text in relation to a sign (corresponding to the italic of KTU)
cont: marking of line continuation in between lines
alt: alternative reading

Name		Name	Last commit message	Last commit date
Latest commit History 88 Commits
.github/workflows		.github/workflows
app		app
lexicon_and_grammar		lexicon_and_grammar
morphemes_files		morphemes_files
tests		tests
tf		tf
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Copenhagen Ugaritic Corpus

The CACCHT project: Creating Annotated Corpora of Classical Hebrew Text

Data

About

Uh oh!

Releases

Packages

Languages

CACCHT/cuc

Folders and files

Latest commit

History

Repository files navigation

Copenhagen Ugaritic Corpus

The CACCHT project: Creating Annotated Corpora of Classical Hebrew Text

Data

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages