How to save web pages for offline reading?
Will Stuyvesant
hwlgw at hotmail.com
Tue Jul 22 11:19:38 EDT 2003
More information about the Python-list mailing list
Tue Jul 22 11:19:38 EDT 2003
- Previous message (by thread): How to save web pages for offline reading?
- Next message (by thread): How to save web pages for offline reading?
- Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]
> [Carsten Gehling] > Well since you ARE on Windows: > > Open the page in Internet Explorer, choose "File" and "Save As...". > You've now saved all necessary files. > I know. But I can't do File - Save As from python :-) I guess it can be done via COM? > > I thought this whole thing would be easy with all those Python > > internet modules in the standard distro: httplib, urllib, urllib2, > > FancyURLxxx etc. Being able to download a "complete" page *from > > Python source* would be very nice for my particular application. > > Well it's doable with those libraries, but you have to put your own meat > on > the bones. > > 1) Use httplib to get the page first. > 2) Parse it for all "src" attributes, and get the supporting files. The > parsin can be done with a html-parser ... That would be htmllib. What you describe is what I am going to do actually, when I have time again. I was about to do it when I thought "somebody must have been doing this before". It seems like mr. Pillai in another reply has done something similar but I couldn't figure out from his source code. Thank you all for the help! -- Agree with them now, it will save so much time.
- Previous message (by thread): How to save web pages for offline reading?
- Next message (by thread): How to save web pages for offline reading?
- Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]
More information about the Python-list mailing list