nazca #198624 sentence delimitation upon single capital character followed by a dot does not make sense [validation pending]
E.g. splitting "Bonjour M. Obama !" into ["Bonjour M.", "Obama !"] does not make much sense.
Probably as regex tweak.
See https://www.logilab.org/ticket/174014 | |
priority | normal |
---|---|
type | bug |
done in | 0.7.0 |
load | 0.500 |
load left | 0.000 |
closed by | #30af4456d4b0 [utils] Use sentences delimiter from NLTK |
patch | [utils] Use sentences delimiter from NLTK [applied] |