Groups | Search | Server Info | Keyboard shortcuts | Login | Register [http] [https] [nntp] [nntps]


Groups > comp.lang.python > #31215 > unrolled thread

Understanding http proxies

Started byOlive <diolu@bigfoot.com>
First post2012-10-13 20:43 +0200
Last post2012-10-15 09:53 +0200
Articles 5 — 5 participants

Back to article view | Back to comp.lang.python


Contents

  Understanding http proxies Olive <diolu@bigfoot.com> - 2012-10-13 20:43 +0200
    Re: Understanding http proxies Chris Angelico <rosuav@gmail.com> - 2012-10-14 05:50 +1100
    Re: Understanding http proxies Tim Roberts <timr@probo.com> - 2012-10-14 12:58 -0700
    Re: Understanding http proxies Cameron Simpson <cs@zip.com.au> - 2012-10-15 11:57 +1100
    Re: Understanding http proxies Olive <diolu.remove_this_part@bigfoot.com> - 2012-10-15 09:53 +0200

#31215 — Understanding http proxies

FromOlive <diolu@bigfoot.com>
Date2012-10-13 20:43 +0200
SubjectUnderstanding http proxies
Message-ID<20121013204358.6d400240@pcolivier.chezmoi.net>
I am trying to understand how to build an http proxy server in python,
and I have found the following example:

http://www.oki-osk.jp/esc/python/proxy/

But I do not have found an exact description of what exactly a proxy
server is suppose to do (all references gice only the basic principe of
proxy that I know). In the following model

Client <-> Proxy <-> Server

it seems when I read the code above that the proxy acts mostly as an
orinary server with respect to the client except that it is supposed to
receive the full URL instead of just the path. Am I right? Is there any
documentation on what an http proxy is supposed to implement.

Olive

[toc] | [next] | [standalone]


#31216

FromChris Angelico <rosuav@gmail.com>
Date2012-10-14 05:50 +1100
Message-ID<mailman.2134.1350154234.27098.python-list@python.org>
In reply to#31215
On Sun, Oct 14, 2012 at 5:43 AM, Olive <diolu@bigfoot.com> wrote:
> it seems when I read the code above that the proxy acts mostly as an
> orinary server with respect to the client except that it is supposed to
> receive the full URL instead of just the path. Am I right? Is there any
> documentation on what an http proxy is supposed to implement.

The easiest way to test this is to knock together a quick little
server, set your browser to use localhost as a proxy, and see what
requests you get.

You're correct as regards most requests, but HTTPS is more
complicated. All your proxy will see is a CONNECT request; you have to
accept or deny on the basis of address alone, you don't get the whole
URL (for obvious reasons). But that aside, yes, you'll normally get a
request that looks pretty similar to what the origin server would get.

ahh, happy memories of MUDding through a local proxy that permitted
CONNECT on more ports than 443... and even happier memories of getting
port 23 opened to direct access, hehe...

ChrisA

[toc] | [prev] | [next] | [standalone]


#31257

FromTim Roberts <timr@probo.com>
Date2012-10-14 12:58 -0700
Message-ID<q16m785u5oisfpmlib0dshhbla5ia7c5r5@4ax.com>
In reply to#31215
Olive <diolu@bigfoot.com> wrote:
>
>it seems when I read the code above that the proxy acts mostly as an
>orinary server with respect to the client except that it is supposed to
>receive the full URL instead of just the path. Am I right? Is there any
>documentation on what an http proxy is supposed to implement.

Consider the ways HTTP could have been implemented.  Say we have a request
to get http://www.bigsite.com/pictures/index.html .

One way HTTP could have been implemented is by sending this request to the
server at www.bigsite.com:

    GET /pictures/index.html HTTP/1.0

If that were how HTTP were done, you could not implement a proxy, because
there isn't enough information for any intermediates to know where the
request had to end up.

Instead, http looks like this:

    GET /pictures/index.html HTTP/1.1
    Host: www.bigsite.com

Now, even if this is sent to someone who is not "www.bigsite.com", that
receipient can tell exactly who is supposed to get the message.

So, a web proxy receives requests intended for other sites, and forwards
them on, possibly after restricting or modifying them.  That's it.
-- 
Tim Roberts, timr@probo.com
Providenza & Boekelheide, Inc.

[toc] | [prev] | [next] | [standalone]


#31270

FromCameron Simpson <cs@zip.com.au>
Date2012-10-15 11:57 +1100
Message-ID<mailman.2187.1350262654.27098.python-list@python.org>
In reply to#31215
On 13Oct2012 20:43, Olive <diolu@bigfoot.com> wrote:
| I am trying to understand how to build an http proxy server in python,
| and I have found the following example:
| http://www.oki-osk.jp/esc/python/proxy/
| 
| But I do not have found an exact description of what exactly a proxy
| server is suppose to do (all references gice only the basic principe of
| proxy that I know). In the following model
| 
| Client <-> Proxy <-> Server
| 
| it seems when I read the code above that the proxy acts mostly as an
| orinary server with respect to the client except that it is supposed to
| receive the full URL instead of just the path. Am I right? Is there any
| documentation on what an http proxy is supposed to implement.

As mentioned elsewhere, in HTTP 1.0 you get a full URL in the opening
line.

In HTTP 1.1 you get the path component in the opening line and the host
part in the Host: header of the request.

Have a read of RFC2616 (which defines HTTP 1.0):

  http://tools.ietf.org/html/rfc2616

It has sections on proxies, too, outlining which they must do beyond
what a plain HTTP server must do (not much, but a few things, and there
are proxy-specific authentication fields available too):

  Proxy Servers
  http://tools.ietf.org/html/rfc2616#section-8.1.3

  Proxy Authenticate
  http://tools.ietf.org/html/rfc2616#section-14.33

Cheers,
-- 
Cameron Simpson <cs@zip.com.au>

There's two kinds of climbers...smart ones, and dead ones.      - Don Whillans

[toc] | [prev] | [next] | [standalone]


#31286

FromOlive <diolu.remove_this_part@bigfoot.com>
Date2012-10-15 09:53 +0200
Message-ID<20121015095346.54d68aa5@pcolivier.chezmoi.net>
In reply to#31215
Thank you for all yours answers. There are very usefull!

Olive

[toc] | [prev] | [standalone]


Back to top | Article view | comp.lang.python


csiph-web