idTenTen – Indonesian corpus from the web
idTenTen: Corpus of the Indonesian Web The Indonesian Web Corpus (idTenTen) is an Indonesian corpus made up of texts collected…
If you are not happy with the results below please do another search
idTenTen: Corpus of the Indonesian Web The Indonesian Web Corpus (idTenTen) is an Indonesian corpus made up of texts collected…
…Tagset Indonesian tagset is available in Indonesian corpora annotated by the tool TreeTagger (with the Indonesian parameter file) developed by…
idWaC: Indonesian web corpus The Indonesian web corpus (idWaC) is an Indonesian corpus made up of texts collected from the…
…the CQL concordance search box: [tag=”” & morph=””] searches for cardinal numerals Indonesian and Malaysian_Previous morphology – Apertium Source http://wiki.apertium.org/wiki/Indonesian_and_Malaysian/Previous_morphology…
…words Indonesian Web (IndonesianWaC) trial 90,120,046 Indonesian Web 2020 (idTenTen20) main 3,687,192,045 Indonesian Web 2024 (idTenTen24) trial 7,108,841,939 OpenSubtitles 2018…
…vietnamese, turkish, chinese-traditional, hindi, telugu, czech, finnish, croatian, italian, swedish, danish, indonesian, chinese-simplified, malayalam, bengali, spanish, estonian, german, arabic, hebrew,…
…2023 (huTenTen23) 3,494,350,960 Icelandic Icelandic Web 2020 (isTenTen20) 518,620,759 Igbo Igbo Web 2015 (IgboWaC15) 331,042 Indonesian Indonesian Web 2024 (idTenTen24)…
…(igTenTen17) Igbo trial 629,294 Indonesian Web (IndonesianWaC) Indonesian trial 90,120,046 Indonesian Web 2020 (idTenTen20) Indonesian main 3,687,192,045 Indonesian Web 2024…
…Galician, Georgian, German, Greek, Hebrew, Hindi, Hungarian, Icelandic, Indonesian, Italian, Japanese, Kazakh, Korean, Latvian, Lithuanian, Macedonian, Malay, Malayalam, Norwegian, Persian…
…German tagsets Greek tagsets Hebrew tagset Hindi tagset Hungarian tagsets Indonesian tagset Irish tagset Italian tagset Japanese tagsets Korean tagset…
…deWaC (sdeWaC)), Greek (gkWaC), Gujarati (guWaC) H Hausa (haWaC ), Hebrew (hebWaC), Hindi (hindiWaC) I Igbo (igWaC), Indonesian (idWaC), Italian…
…I Indonesian, Igbo, Sichuan Yi, Iloko, Ingush, Icelandic, Italian J Japanese, Machame, Javanese K Kartuli (Georgian), Kabyle, Kaje (Jju), Kamba,…
…huTenTen (Hungarian web corpus) idTenTen (Indonesian web corpus) isTenTen (Icelandic web corpus) itTenTen (Italian web corpus) jaTenTen (Japanese web corpus)…
…English tagsets Estonian tagsets Finnish tagsets French tagsets German tagsets Greek tagsets Hebrew tagsets Hindi tagset Hungarian tagsets Indonesian tagset…
…ⓧ Indonesian ✓ ✓ full ⓧ Irish ✓ ✓ full ⓧ Italian ✓ ✓ full ✓ Japanese ✓ ✓ full…