Groups | Search | Server Info | Keyboard shortcuts | Login | Register [http] [https] [nntp] [nntps]


Groups > comp.lang.java.programmer > #10027 > unrolled thread

proxy baffement

Started byRoedy Green <see_website@mindprod.com.invalid>
First post2011-11-17 18:31 -0800
Last post2011-11-19 13:49 -0800
Articles 13 — 4 participants

Back to article view | Back to comp.lang.java.programmer


Contents

  proxy baffement Roedy Green <see_website@mindprod.com.invalid> - 2011-11-17 18:31 -0800
    Re: proxy baffement Roedy Green <see_website@mindprod.com.invalid> - 2011-11-17 19:30 -0800
    Re: proxy baffement Andreas Leitgeb <avl@gamma.logic.tuwien.ac.at> - 2011-11-18 08:11 +0000
      Re: proxy baffement Roedy Green <see_website@mindprod.com.invalid> - 2011-11-18 02:34 -0800
      Re: proxy baffement Roedy Green <see_website@mindprod.com.invalid> - 2011-11-18 02:44 -0800
        Re: proxy baffement Gene Wirchenko <genew@ocis.net> - 2011-11-18 09:44 -0800
          Re: proxy baffement Roedy Green <see_website@mindprod.com.invalid> - 2011-11-18 10:06 -0800
        Re: proxy baffement Andreas Leitgeb <avl@gamma.logic.tuwien.ac.at> - 2011-11-19 11:18 +0000
          Re: proxy baffement Roedy Green <see_website@mindprod.com.invalid> - 2011-11-21 10:29 -0800
      Re: proxy baffement Roedy Green <see_website@mindprod.com.invalid> - 2011-11-18 03:12 -0800
    Re: proxy baffement Steven Simpson <ss@domain.invalid> - 2011-11-18 22:06 +0000
    Re: proxy baffement Andreas Leitgeb <avl@gamma.logic.tuwien.ac.at> - 2011-11-19 11:10 +0000
    Re: proxy baffement Roedy Green <see_website@mindprod.com.invalid> - 2011-11-19 13:49 -0800

#10027 — proxy baffement

FromRoedy Green <see_website@mindprod.com.invalid>
Date2011-11-17 18:31 -0800
Subjectproxy baffement
Message-ID<qefbc75va1lc98u1i4db78eo4ipcmeeeeh@4ax.com>
This is not so much a question about Java as about HTTP.

I wrote a program called Brokenlinks that tests all the links on my
website and tells me about permanent redirects and sites that have
stayed dead for 6+ days.  (It filters out temporary outages).

Something strange is happening with http://ask.com

My program, written in Java, says it is always dead.  Firefox often
thinks it is alive. 

I fired up Wireshark to see if I could figure out what was going on.

The first thing I noticed was this

GET
/tbproxy/lh/fixurl?hl=en-US&sd=ca&url=http%3A%2F%2Fask.com%2F&sourceid=chrome&error=connectionfailure
HTTP/1.1
Host: linkhelp.clients.google.com

Somehow my request is sometimes going through a Google proxy.  Buy
WHY? What has google got to do with this? I am am using Firefox, not
Chrome.

Further, I notice Chrome often says "resolving proxy" and dithers for
a long time loading a LOCAL file off my hard disk. Why would a web
proxy be involved at all?  

Opera has been completely unusable for about two years because it
takes about a minute to load a file off local disk (it is fine on the
web).  I thought had something to do with Google Javascript fetching
ads and translate slowly, but maybe that too is a proxy problem.  The
Opera people ignored me every time I reported the problem.

I always thought proxies just did a bit of caching, but otherwise you
could ignore them. Yet it seem clearly my browser is speaking a
different protocol from usual. It seems to be aware of its presence.

My questions:

If my IAP is using a proxy, what are the benefits?

Is there a way to sneak around it in case it is screwing things up?

Why google?  Do the host proxies for IAPs?  Is ASK owned by Google?

Is my Java program supposed to be proxy-aware and do something
different?

-- 
Roedy Green Canadian Mind Products
http://mindprod.com
I can't come to bed just yet. Somebody is wrong on the Internet. 

[toc] | [next] | [standalone]


#10030

FromRoedy Green <see_website@mindprod.com.invalid>
Date2011-11-17 19:30 -0800
Message-ID<97kbc7tb747iohd5kk289qorgtjenretmm@4ax.com>
In reply to#10027
On Thu, 17 Nov 2011 18:31:51 -0800, Roedy Green
<see_website@mindprod.com.invalid> wrote, quoted or indirectly quoted
someone who said :

>
>Is there a way to sneak around it in case it is screwing things up?

I posted what I have discovered so far at
http://mindprod.com/jgloss/proxy.html

Another puzzle has come out. Where does Windows get its its notion of
what proxy to use from?  It is not part of the DHCP protocol.  The
docs I have found suggest it has to learn by manually configuring an
IP.  I most definitely did not do that.  I wonder if some Google
product did it.

-- 
Roedy Green Canadian Mind Products
http://mindprod.com
I can't come to bed just yet. Somebody is wrong on the Internet. 

[toc] | [prev] | [next] | [standalone]


#10033

FromAndreas Leitgeb <avl@gamma.logic.tuwien.ac.at>
Date2011-11-18 08:11 +0000
Message-ID<slrnjcc4pi.fvg.avl@gamma.logic.tuwien.ac.at>
In reply to#10027
Roedy Green <see_website@mindprod.com.invalid> wrote:
> My program, written in Java, says it [ask.com] is always dead.
>  Firefox often thinks it is alive. 
> I fired up Wireshark to see if I could figure out what was going on.
> The first thing I noticed was this
> GET
> /tbproxy/lh/fixurl?hl=en-US&sd=ca&url=http%3A%2F%2Fask.com%2F&sourceid=chrome&error=connectionfailure
> HTTP/1.1
> Host: linkhelp.clients.google.com

The "error=connectionfailure" makes it look like the result of some
other software that gets triggered by failed connect-attempts and
tries to retrieve separate information for the failure.  It could
be a browser-addon, but perhaps could also be software installed
at system level (chrome surely looks quite suspicious for that).

Doesn't wireshark show some SYN packets going to ask.com before this
line?

> Opera has been completely unusable for about two years because it
> takes about a minute to load a file off local disk (it is fine on the
> web).

Cannot reproduce that with opera on my machine. (and perhaps the 
opera-people weren't able, either.)  Perhaps you use network-drives
mounted locally?  Or an over-eager virus-scanner, that intercepts
local file reads?  Or such without the "-scanner"...

> Is there a way to sneak around it in case it is screwing things up?

Subscribe to a different one.  But your problem doesn't so far look
like it was the IAP's responsibility.
While they could force a more or less transparent proxy upon all your
connections to the outside world, they surely have no means to interfere
with local file accesses, unless something goes wrong in the browser.

[toc] | [prev] | [next] | [standalone]


#10036

FromRoedy Green <see_website@mindprod.com.invalid>
Date2011-11-18 02:34 -0800
Message-ID<q5dcc7lg35jq2o3n8ahtmhar56f859mqcu@4ax.com>
In reply to#10033
On 18 Nov 2011 08:11:30 GMT, Andreas Leitgeb
<avl@gamma.logic.tuwien.ac.at> wrote, quoted or indirectly quoted
someone who said :

>Doesn't wireshark show some SYN packets going to ask.com before this
>line?

I was only looking at GETs
-- 
Roedy Green Canadian Mind Products
http://mindprod.com
I can't come to bed just yet. Somebody is wrong on the Internet. 

[toc] | [prev] | [next] | [standalone]


#10037

FromRoedy Green <see_website@mindprod.com.invalid>
Date2011-11-18 02:44 -0800
Message-ID<pjdcc7dsja7odtnfaog2jispgmv5l821b2@4ax.com>
In reply to#10033
On 18 Nov 2011 08:11:30 GMT, Andreas Leitgeb
<avl@gamma.logic.tuwien.ac.at> wrote, quoted or indirectly quoted
someone who said :

>> Opera has been completely unusable for about two years because it
>> takes about a minute to load a file off local disk (it is fine on the
>> web).
>
>Cannot reproduce that with opera on my machine

It would be helpful if you performed this experiment.

Download a typical page from my website, e.g.
http://mindprod.com/jgloss/wireshark.html

and save it on local hard disk.

The try loading it with Opera.

If it loads fine that suggests there is something weird with my
machine.
If it is very slow to render, that suggests there is something strange
about my pages.
-- 
Roedy Green Canadian Mind Products
http://mindprod.com
I can't come to bed just yet. Somebody is wrong on the Internet. 

[toc] | [prev] | [next] | [standalone]


#10048

FromGene Wirchenko <genew@ocis.net>
Date2011-11-18 09:44 -0800
Message-ID<2b6dc7tjrhfmj4n31np0ar5ml2npah58s1@4ax.com>
In reply to#10037
On Fri, 18 Nov 2011 02:44:08 -0800, Roedy Green
<see_website@mindprod.com.invalid> wrote:

>On 18 Nov 2011 08:11:30 GMT, Andreas Leitgeb
><avl@gamma.logic.tuwien.ac.at> wrote, quoted or indirectly quoted
>someone who said :
>
>>> Opera has been completely unusable for about two years because it
>>> takes about a minute to load a file off local disk (it is fine on the
>>> web).
>>
>>Cannot reproduce that with opera on my machine
>
>It would be helpful if you performed this experiment.
>
>Download a typical page from my website, e.g.
>http://mindprod.com/jgloss/wireshark.html
>
>and save it on local hard disk.
>
>The try loading it with Opera.
>
>If it loads fine that suggests there is something weird with my
>machine.
>If it is very slow to render, that suggests there is something strange
>about my pages.

     I do not use Opera, but I tried this with Firefox 8.0 under
Windows XP SP 3.  The render from file was fine.

Sincerely,

Gene Wirchenko

[toc] | [prev] | [next] | [standalone]


#10051

FromRoedy Green <see_website@mindprod.com.invalid>
Date2011-11-18 10:06 -0800
Message-ID<6d7dc7d3aphbiel7pli3lcalrj666m05o6@4ax.com>
In reply to#10048
On Fri, 18 Nov 2011 09:44:48 -0800, Gene Wirchenko <genew@ocis.net>
wrote, quoted or indirectly quoted someone who said :

>     I do not use Opera, but I tried this with Firefox 8.0 under
>Windows XP SP 3.  The render from file was fine.

For me Chrome work quite well from local disk.  Opera is impossible,
and Firefox is usually ok, but sometimes takes a long time.

The catch is Chrome can't do Applets, so I have to toggle back and
forth between Chrome and FIrefox.  Firefox has a weird stuttering
problem with Java. All is going fine and it gets into its head to
reload the page without the Java.  I manually reload the page and it
comes back.  Even when it is working fine, FIrefox calls my init
methods twice when a page loads.


The browser people seem to be trying as hard as Microsoft to kill
Java.
-- 
Roedy Green Canadian Mind Products
http://mindprod.com
I can't come to bed just yet. Somebody is wrong on the Internet. 

[toc] | [prev] | [next] | [standalone]


#10066

FromAndreas Leitgeb <avl@gamma.logic.tuwien.ac.at>
Date2011-11-19 11:18 +0000
Message-ID<slrnjcf43n.fvg.avl@gamma.logic.tuwien.ac.at>
In reply to#10037
Roedy Green <see_website@mindprod.com.invalid> wrote:
> It would be helpful if you performed this experiment.
> Download a typical page from my website, e.g.
> http://mindprod.com/jgloss/wireshark.html
> and save it on local hard disk.

Did that, but opened it in a textviewer instead, and saw a couple
of <script ... src="http://..."> in it.  So much for "local."

[toc] | [prev] | [next] | [standalone]


#10159

FromRoedy Green <see_website@mindprod.com.invalid>
Date2011-11-21 10:29 -0800
Message-ID<k06lc793tnn7l18ec5dt9o85atrvnj96lv@4ax.com>
In reply to#10066
On 19 Nov 2011 11:18:15 GMT, Andreas Leitgeb
<avl@gamma.logic.tuwien.ac.at> wrote, quoted or indirectly quoted
someone who said :

>Did that, but opened it in a textviewer instead, and saw a couple
>of <script ... src="http://..."> in it.  So much for "local."
 
What you saw were google adsense ads and a google translate widget.

The odd thing is when the same page is loaded from a website it works
fine.  It is WAY slower when loaded off local hard disk.  

This may simply be Google screwing up, or deliberately hosing other
browsers besides Google Chrome which works fairly well.

Yet I think some people don't have trouble with loading from local
hard disk.
-- 
Roedy Green Canadian Mind Products
http://mindprod.com
I can't come to bed just yet. Somebody is wrong on the Internet. 

[toc] | [prev] | [next] | [standalone]


#10038

FromRoedy Green <see_website@mindprod.com.invalid>
Date2011-11-18 03:12 -0800
Message-ID<d1fcc7tsv1o4el6qonf076gcj82bea2doj@4ax.com>
In reply to#10033
On 18 Nov 2011 08:11:30 GMT, Andreas Leitgeb
<avl@gamma.logic.tuwien.ac.at> wrote, quoted or indirectly quoted
someone who said :

>Cannot reproduce that with opera on my machine. (and perhaps the 
>opera-people weren't able, either.)  Perhaps you use network-drives
>mounted locally?  Or an over-eager virus-scanner, that intercepts
>local file reads?  Or such without the "-scanner"...

In trying to track this down I turned off dynamic virus checking,
Windows index, Copernic index.  I have most of the accelerators, e.g.
Open Office  that load at boot time turned off. This proxy thing looks
like the best lead so far since it is in there without my permission.

The was another anomaly than seems to have gone away.  Timeouts in my
Java code HTTP Get code sometimes would not time out. They just sat
there forever.

Maybe it is time for another Windows 7 install to scape off the
barnacles.

Where does your father keep his barnacles? 
~ One of Charles Darwin’s children to a school friend.
-- 
Roedy Green Canadian Mind Products
http://mindprod.com
I can't come to bed just yet. Somebody is wrong on the Internet. 

[toc] | [prev] | [next] | [standalone]


#10058

FromSteven Simpson <ss@domain.invalid>
Date2011-11-18 22:06 +0000
Message-ID<jf1jp8-6od.ln1@news.simpsonst.f2s.com>
In reply to#10027
On 18/11/11 02:31, Roedy Green wrote:
> GET
> /tbproxy/lh/fixurl?hl=en-US&sd=ca&url=http%3A%2F%2Fask.com%2F&sourceid=chrome&error=connectionfailure
> HTTP/1.1
> Host: linkhelp.clients.google.com
>
> Somehow my request is sometimes going through a Google proxy.  Buy
> WHY? What has google got to do with this? I am am using Firefox, not
> Chrome.

Are you sure the request is from your program?  Does User-Agent show it 
to be Java, at least?  You could set it to be more specific, to be sure 
it's /your/ Java program.

-- 
ss at comp dot lancs dot ac dot uk

[toc] | [prev] | [next] | [standalone]


#10065

FromAndreas Leitgeb <avl@gamma.logic.tuwien.ac.at>
Date2011-11-19 11:10 +0000
Message-ID<slrnjcf3ls.fvg.avl@gamma.logic.tuwien.ac.at>
In reply to#10027
Roedy Green <see_website@mindprod.com.invalid> wrote:
> I fired up Wireshark to see if I could figure out what was going on.
> The first thing I noticed was this
> GET
> /tbproxy/lh/fixurl?hl=en-US&sd=ca&url=http%3A%2F%2Fask.com%2F&sourceid=chrome&error=connectionfailure
> HTTP/1.1
> Host: linkhelp.clients.google.com

Another thing: I googled for "tbproxy fixurl" and got hits mostly talking
about some google-script to be used for a site's 404-page.

[toc] | [prev] | [next] | [standalone]


#10072

FromRoedy Green <see_website@mindprod.com.invalid>
Date2011-11-19 13:49 -0800
Message-ID<0j8gc79dr725b2td6q686o3lvvqlprqbb8@4ax.com>
In reply to#10027
On Thu, 17 Nov 2011 18:31:51 -0800, Roedy Green
<see_website@mindprod.com.invalid> wrote, quoted or indirectly quoted
someone who said :

>Is my Java program supposed to be proxy-aware and do something
>different?

In poking around to figure out what is going on I discovered a proxy
can be configured in the Windows control panel.  There is also a place
to configure one in the Java Control panel.

I found a note suggesting that if you ticked "automatically detect
settings" in the Windows control panel, you might end up with a proxy.

There are three Java system properties.
System.setProperty( "proxySet", "true" );
System.setProperty( "http.proxyHost", proxyHostName );
System.setProperty( "http.proxyPort", Integer.toString( proxyHostPort
) );

It seems all you need to hook up a proxy is its dns name or ip.  Even
though there are about 5 differerent proxy protocols, I gather the
machines sort that out themselves.

I also discovered inside Google Chrome in "under the hood" you can
change proxy settings. However, I think that is just a hook into the
Windows control panel.


-- 
Roedy Green Canadian Mind Products
http://mindprod.com
I can't come to bed just yet. Somebody is wrong on the Internet. 

[toc] | [prev] | [standalone]


Back to top | Article view | comp.lang.java.programmer


csiph-web