Groups | Search | Server Info | Keyboard shortcuts | Login | Register [http] [https] [nntp] [nntps]


Groups > comp.lang.java.programmer > #19457

(extra) parsing (and verifying of) URL's via java.net.URI ...

From qwertmonkey@syberianoutpost.ru
Newsgroups comp.lang.java.programmer
Subject (extra) parsing (and verifying of) URL's via java.net.URI ...
Date 2012-10-21 14:45 +0000
Organization Aioe.org NNTP Server
Message-ID <k611qi$ftq$1@speranza.aioe.org> (permalink)

Show all headers | View raw


~ 
 as stated in the API
~ 
 http://docs.oracle.com/javase/7/docs/api/java/net/URI.html
~ 
 this would be an extra feature, but it would be nice if as part of the API
 you could tell apart what is a TLD, as well as suffixes that some regional
 registrar authorities mark as not publicly available. For example, there 
 are lots of extra suffixes the Japanese apparently guard (basically all cities
 and prefectures):
~ 
 ".ac.jp", ".abashiri.hokkaido.jp", ...
~ 
 Noregeans do the same thing: ".fylkesbibl.no", ".gs.[county].no", ...
~ 
 I think in the states they have similar restrictions regarding tlds and 
 suffixes such as ".edu" and ".mil" and the French even guard their dentists to
 the point of pairing them with surgeons: ".chirurgiens-dentistes.fr" ;-)
~ 
 Those parsing issues are important in use cases in which you want to know
 what is in a host name:
~ 
$ host www.dos.state.fl.us
www.dos.state.fl.us has address 207.156.20.19

$ host dos.state.fl.us
dos.state.fl.us has address 207.156.20.19
dos.state.fl.us mail is handled by 10 mail.dos.state.fl.us.
dos.state.fl.us mail is handled by 20 xm.dos.state.fl.us.

$ host state.fl.us
state.fl.us mail is handled by 10 dohsmsi07.doh.state.fl.us.
state.fl.us mail is handled by 10 dohsmsi06.doh.state.fl.us.

$ host doh.state.fl.us
doh.state.fl.us has address 199.250.17.86
doh.state.fl.us mail is handled by 1 mx0003.doh.state.fl.us.
doh.state.fl.us mail is handled by 1 mx5201.doh.state.fl.us.
doh.state.fl.us mail is handled by 1 mx0001.doh.state.fl.us.
doh.state.fl.us mail is handled by 1 mx0002.doh.state.fl.us.
 
$ host fl.us
$
~ 
 subdepartments inside of a University department may have their own
 website (or not)
~ 
$ host cornell.edu
cornell.edu has address 128.253.173.242
cornell.edu has address 128.253.173.243
cornell.edu has address 128.253.173.244
cornell.edu has address 128.253.173.245
cornell.edu has address 128.253.173.246
cornell.edu has address 128.253.173.241
cornell.edu mail is handled by 10 router2.mail.cornell.edu.
cornell.edu mail is handled by 10 router3.mail.cornell.edu.
cornell.edu mail is handled by 10 router4.mail.cornell.edu.
cornell.edu mail is handled by 10 router9.mail.cornell.edu.
cornell.edu mail is handled by 10 router10.mail.cornell.edu.
cornell.edu mail is handled by 10 router1.mail.cornell.edu.

knoppix@Microknoppix:~$ host www.cs.cornell.edu
www.cs.cornell.edu is an alias for www1.cs.cornell.edu.
www1.cs.cornell.edu has address 128.84.154.137

knoppix@Microknoppix:~$ host www.cornell.edu
www.cornell.edu is an alias for wwwcornelledu-ssl.cit.cornell.edu.
wwwcornelledu-ssl.cit.cornell.edu has address 132.236.204.10
~ 
 There is lots of info scattered all over the Internet about web naming issues
~ 
 http://www.cs.cornell.edu/people/egs/papers/dnssurvey.pdf
~ 
 but AFAIK there is not central registry of such data. Do you know better?
~ 
 thanks
 lbrtchx
 comp.lang.java.programmer: (extra) parsing (and verifying of) URL's via java.net.URI ...

Back to comp.lang.java.programmer | Previous | NextNext in thread | Find similar | Unroll thread


Thread

(extra) parsing (and verifying of) URL's via java.net.URI ... qwertmonkey@syberianoutpost.ru - 2012-10-21 14:45 +0000
  Re: (extra) parsing (and verifying of) URL's via java.net.URI ... markspace <-@.> - 2012-10-21 09:43 -0700
  Re: (extra) parsing (and verifying of) URL's via java.net.URI ... Arne Vajhoej <arne@vajhoej.dk> - 2012-10-21 13:53 -0400
  Re: (extra) parsing (and verifying of) URL's via java.net.URI ... Roedy Green <see_website@mindprod.com.invalid> - 2012-10-23 04:29 -0700
  Re: (extra) parsing (and verifying of) URL's via java.net.URI ... paul.cager@googlemail.com - 2012-10-23 05:34 -0700

csiph-web