Groups | Search | Server Info | Keyboard shortcuts | Login | Register [http] [https] [nntp] [nntps]
Groups > linux.debian.maint.java > #12431
| From | Markus Koschany <apo@debian.org> |
|---|---|
| Newsgroups | linux.debian.bugs.dist, linux.debian.devel, linux.debian.maint.java |
| Subject | Bug#1018100: ITP: liblanguage-detector-java -- Language Detection Library for Java |
| Date | 2022-08-25 19:10 +0200 |
| Message-ID | <EYowF-31lB-1@gated-at.bofh.it> (permalink) |
| Organization | linux.* mail to news gateway |
Cross-posted to 3 groups.
Package: wnpp
Severity: wishlist
Owner: Markus Koschany <apo@debian.org>
X-Debbugs-Cc: debian-devel@lists.debian.org, apo@debian.org,debian-java@lists.debian.org
* Package name : liblanguage-detector-java
Version : 0.6
Upstream Author : Nakatani Shuyo, Francois ROLAND, Fabian Kessler,
Nicole Torres, Robert Theis
* URL : https://github.com/optimaize/language-detector
* License : Apache-2.0
Programming Lang: Java
Description : Language Detection Library for Java
This software uses language profiles which were created based on
common text for each language. N-grams, a contiguous sequence of n
items from a given sample of text, were then extracted from that text
and stored in the profiles. When trying to figure out in what
language a certain text is written, the program goes through the same
process: It creates the same kind of n-grams of the input text. Then
it compares the relative frequency of them, and finds the language
that matches best. Currently 71 languages are supported.
Back to linux.debian.maint.java | Previous | Next | Find similar
Bug#1018100: ITP: liblanguage-detector-java -- Language Detection Library for Java Markus Koschany <apo@debian.org> - 2022-08-25 19:10 +0200
csiph-web