Update README.md

This commit is contained in:
bfsujason
2021-05-18 01:00:21 +08:00
committed by GitHub
parent 0df79e3139
commit 62e801b0bf

View File

@@ -9,3 +9,9 @@ Firstly, 5 chapters and their translations are sampled from each of the 6 novels
The **MAC-Test** is saved in [corpus/mac/test](./corpus/mac/test)
The sampling schemes for building **MAC-Test** can be found at [corpus/mac/test/meta_data.tsv](./corpus/mac/test/meta_data.tsv)
There are 4 subdirectories in **MAC-Test**. The [split](/corpus/mac/test/split) directory contains the sentence-split source texts, target texts and the machine translations of source texts, which are required by **Bleualign** to perform automatic alignment.
The inputs to **Hunalign** are saved in the [tok](/corpus/mac/test/tok) directory.
The emb directory is made up of the overlapping sentences and their embeddings for Vecalign and BertAlign.