Is there a RegEx spellchecker for Albanian?
It would help us on Wikipedia.
Arianit
Is there a RegEx spellchecker for Albanian?
It would help us on Wikipedia.
Arianit
I am not sure a regec spellchecker is what we need. Perhaps https://en.wikipedia.org/wiki/Levenshtein_distance is better. In any case, I am not aware of something that we could use. But, perhaps we can build it.
Some tools are using it. It takes energy to build it from scratch.
Hi again Arianit,
do you think you could describe exactly what it is you need. You say “some tools are using it”. What tools, and what is “it” that the tools are using. can you please describe the process you need to automate? https://en.wikibooks.org/wiki/Algorithm_Implementation/Strings/Levenshtein_distance is already implemented. If this boils down to just using wiring input to an existing implementation of the algorithm, we can perhaps do that quickly.
Let me know,
Gagi
Could you comment further on what a regex spellchecker is or why you
need it? If you are fine with ignoring the context of words (and thus
grammar), you could write something like this except with more words.
/^(?:a|abaci|abacia)$/
Then tokenize the words, and any word that does not match the regular
expression would be considered misspelled. Of course, at this point you
it doesn't need to be a regular expression.
Need it for AutoWikiBrowser
https://en.wikipedia.org/wiki/Wikipedia:AutoWikiBrowser/Typos
There is an Albanian GPL dictionary here http://www.shkenca.org/k6i/
Thanks to Ngadhnjim for starting the list https://sq.wikipedia.org/wiki/Wikipedia:AutoWikiBrowser/Typos
If you have others, feel free to add.
Cheers!