first commit

This commit is contained in:
nlpfun
2021-05-18 00:03:45 +08:00
parent ca6ffedb45
commit 6d284528b4
430 changed files with 1467034 additions and 0 deletions

33691
corpus/bible/en.verse Normal file

File diff suppressed because it is too large Load Diff

View File

@@ -0,0 +1,7 @@
text_id text_length
001 5000
002 10000
003 15000
004 20000
005 25000
006 30000
1 text_id text_length
2 001 5000
3 002 10000
4 003 15000
5 004 20000
6 005 25000
7 006 30000

5000
corpus/bible/split/001.en Normal file

File diff suppressed because it is too large Load Diff

5000
corpus/bible/split/001.trans Normal file

File diff suppressed because it is too large Load Diff

6301
corpus/bible/split/001.zh Normal file

File diff suppressed because it is too large Load Diff

File diff suppressed because it is too large Load Diff

10000
corpus/bible/split/002.en Normal file

File diff suppressed because it is too large Load Diff

10000
corpus/bible/split/002.trans Normal file

File diff suppressed because it is too large Load Diff

13056
corpus/bible/split/002.zh Normal file

File diff suppressed because it is too large Load Diff

13056
corpus/bible/split/002.zh.tok Normal file

File diff suppressed because it is too large Load Diff

15000
corpus/bible/split/003.en Normal file

File diff suppressed because it is too large Load Diff

15000
corpus/bible/split/003.trans Normal file

File diff suppressed because it is too large Load Diff

19653
corpus/bible/split/003.zh Normal file

File diff suppressed because it is too large Load Diff

19653
corpus/bible/split/003.zh.tok Normal file

File diff suppressed because it is too large Load Diff

20000
corpus/bible/split/004.en Normal file

File diff suppressed because it is too large Load Diff

20000
corpus/bible/split/004.trans Normal file

File diff suppressed because it is too large Load Diff

27678
corpus/bible/split/004.zh Normal file

File diff suppressed because it is too large Load Diff

27678
corpus/bible/split/004.zh.tok Normal file

File diff suppressed because it is too large Load Diff

25000
corpus/bible/split/005.en Normal file

File diff suppressed because it is too large Load Diff

25000
corpus/bible/split/005.trans Normal file

File diff suppressed because it is too large Load Diff

35980
corpus/bible/split/005.zh Normal file

File diff suppressed because it is too large Load Diff

35980
corpus/bible/split/005.zh.tok Normal file

File diff suppressed because it is too large Load Diff

30000
corpus/bible/split/006.en Normal file

File diff suppressed because it is too large Load Diff

30000
corpus/bible/split/006.trans Normal file

File diff suppressed because it is too large Load Diff

42687
corpus/bible/split/006.zh Normal file

File diff suppressed because it is too large Load Diff

42687
corpus/bible/split/006.zh.tok Normal file

File diff suppressed because it is too large Load Diff

5000
corpus/bible/tok/001.en Normal file

File diff suppressed because it is too large Load Diff

6301
corpus/bible/tok/001.zh Normal file

File diff suppressed because it is too large Load Diff

10000
corpus/bible/tok/002.en Normal file

File diff suppressed because it is too large Load Diff

13056
corpus/bible/tok/002.zh Normal file

File diff suppressed because it is too large Load Diff

15000
corpus/bible/tok/003.en Normal file

File diff suppressed because it is too large Load Diff

19653
corpus/bible/tok/003.zh Normal file

File diff suppressed because it is too large Load Diff

20000
corpus/bible/tok/004.en Normal file

File diff suppressed because it is too large Load Diff

27678
corpus/bible/tok/004.zh Normal file

File diff suppressed because it is too large Load Diff

25000
corpus/bible/tok/005.en Normal file

File diff suppressed because it is too large Load Diff

35980
corpus/bible/tok/005.zh Normal file

File diff suppressed because it is too large Load Diff

30000
corpus/bible/tok/006.en Normal file

File diff suppressed because it is too large Load Diff

42687
corpus/bible/tok/006.zh Normal file

File diff suppressed because it is too large Load Diff

47864
corpus/bible/zh.verse Normal file

File diff suppressed because it is too large Load Diff