Groups | Search | Server Info | Keyboard shortcuts | Login | Register [http] [https] [nntp] [nntps]


Groups > linux.debian.maint.java > #12431

Bug#1018100: ITP: liblanguage-detector-java -- Language Detection Library for Java

From Markus Koschany <apo@debian.org>
Newsgroups linux.debian.bugs.dist, linux.debian.devel, linux.debian.maint.java
Subject Bug#1018100: ITP: liblanguage-detector-java -- Language Detection Library for Java
Date 2022-08-25 19:10 +0200
Message-ID <EYowF-31lB-1@gated-at.bofh.it> (permalink)
Organization linux.* mail to news gateway

Cross-posted to 3 groups.

Show all headers | View raw


Package: wnpp
Severity: wishlist
Owner: Markus Koschany <apo@debian.org>
X-Debbugs-Cc: debian-devel@lists.debian.org, apo@debian.org,debian-java@lists.debian.org

* Package name    : liblanguage-detector-java
  Version         : 0.6
  Upstream Author : Nakatani Shuyo, Francois ROLAND, Fabian Kessler,
                    Nicole Torres, Robert Theis
* URL             : https://github.com/optimaize/language-detector
* License         : Apache-2.0
  Programming Lang: Java
  Description     : Language Detection Library for Java

This software uses language profiles which were created based on
common text for each language. N-grams, a contiguous sequence of n
items from a given sample of text, were then extracted from that text
and stored in the profiles. When trying to figure out in what
language a certain text is written, the program goes through the same
process: It creates the same kind of n-grams of the input text. Then
it compares the relative frequency of them, and finds the language
that matches best. Currently 71 languages are supported.

Back to linux.debian.maint.java | Previous | Next | Find similar


Thread

Bug#1018100: ITP: liblanguage-detector-java -- Language Detection Library for Java Markus Koschany <apo@debian.org> - 2022-08-25 19:10 +0200

csiph-web