Get entire web page?
Steve Holden
sholden at holdenweb.com
Fri Nov 3 23:44:04 EST 2000
More information about the Python-list mailing list
Fri Nov 3 23:44:04 EST 2000
- Previous message (by thread): can't find standard libraries after RedHat 7.0 install. Help?
- Next message (by thread): Get entire web page?
- Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]
Whatdoesthisbuttondo? wrote: > > I'm trying to write a script so that I can do a throughput test for a > firewall/proxy. I know how to grab the text of a page, what I'm not > sure about is how to get all of the referenced graphics. > Any help or pointers existing python code would definitely be > appreciated. > > Thanks > > Sent via Deja.com http://www.deja.com/ > Before you buy. The htmllib module allows you to parse the returned HTML, and you need to handle the images. This means writing a subclass of HTMLParser which overrides (defines its own) handle_image method: the parser will call handle_image each time it processes an image tag. From the 1.5.2 docs: handle_image (source, alt[, ismap[, align[, width[, height]]]]) This method is called to handle images. The default implementation simply passes the alt value to the handle_data() method. It's scary the first time you do it, but turns out to be quite simple if you overcome your fear. Good luck! By the way: DON'T TOUCH THAT BUTTON! ;-) regards Steve -- Helping people meet their information needs with training and technology. 703 967 0887 sholden at bellatlantic.net http://www.holdenweb.com/
- Previous message (by thread): can't find standard libraries after RedHat 7.0 install. Help?
- Next message (by thread): Get entire web page?
- Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]
More information about the Python-list mailing list