diff options
author | StefanVukovic99 <stefanvukovic44@gmail.com> | 2024-05-22 22:45:39 +0200 |
---|---|---|
committer | GitHub <noreply@github.com> | 2024-05-22 20:45:39 +0000 |
commit | d19b898792bffed8ab2d5724472e5b65a5f5b146 (patch) | |
tree | b3e0d5111d748dfcc5d74d9dbf68e79193fa6a7f /test/data/translator-test-inputs.json | |
parent | 125cde3d98c18b08e71e075b4a9776fc7bd4b4a0 (diff) |
[ja] add preprocessor for width of alphabetic characters (#964)
* add japanese text preprocessor for variants in width of alphabetic characters
* try combining with numeric to improve performance
* Update ext/js/language/ja/japanese.js
Co-authored-by: Kuuuube <61125188+Kuuuube@users.noreply.github.com>
Signed-off-by: StefanVukovic99 <stefanvukovic44@gmail.com>
* Update ext/js/language/ja/japanese.js
Co-authored-by: Kuuuube <61125188+Kuuuube@users.noreply.github.com>
Signed-off-by: StefanVukovic99 <stefanvukovic44@gmail.com>
* fix tests
---------
Signed-off-by: StefanVukovic99 <stefanvukovic44@gmail.com>
Co-authored-by: Kuuuube <61125188+Kuuuube@users.noreply.github.com>
Diffstat (limited to 'test/data/translator-test-inputs.json')
-rw-r--r-- | test/data/translator-test-inputs.json | 28 |
1 files changed, 27 insertions, 1 deletions
diff --git a/test/data/translator-test-inputs.json b/test/data/translator-test-inputs.json index 9f508a1d..ed028273 100644 --- a/test/data/translator-test-inputs.json +++ b/test/data/translator-test-inputs.json @@ -369,7 +369,33 @@ ] }, { - "name": "Test text preprocessors - convertAlphabeticCharacters", + "name": "Test text preprocessors - alphabeticWidthVariants 1", + "func": "findTerms", + "mode": "split", + "text": "English", + "options": [ + "default", + { + "type": "terms", + "removeNonJapaneseCharacters": false + } + ] + }, + { + "name": "Test text preprocessors - alphabeticWidthVariants 2", + "func": "findTerms", + "mode": "split", + "text": "USB", + "options": [ + "default", + { + "type": "terms", + "removeNonJapaneseCharacters": false + } + ] + }, + { + "name": "Test text preprocessors - alphabeticToHiragana", "func": "findTerms", "mode": "split", "text": "utsu", |