Message88864
| Author | momat |
|---|---|
| Recipients | cpalmer, fantoozler, fdrake, georg.brandl, gsf, momat |
| Date | 2009-06-04.06:33:54 |
| SpamBayes Score | 0.003998648 |
| Marked as misclassified | No |
| Message-id | <1244097253.39.0.143495255584.issue670664@psf.upfronthosting.co.za> |
| In-reply-to |
| Content | |
|---|---|
A simple workaround for the BeautifulSoup is the following wrapper. It
sanitize the javascript code before passing it to the parser by joining
the disjoint strings, so that "</scr"+"ipt>" becomes "</script>".
def bs(input):
pattern = re.compile('\"\+\"')
match = lambda x: ""
massage = copy.copy(BeautifulSoup.MARKUP_MASSAGE)
massage.extend([(pattern, match)])
return BeautifulSoup(input, markupMassage=massage) |
|
| History | |||
|---|---|---|---|
| Date | User | Action | Args |
| 2009-06-04 06:34:15 | momat | set | recipients: + momat, fdrake, georg.brandl, fantoozler, gsf, cpalmer |
| 2009-06-04 06:34:13 | momat | set | messageid: <1244097253.39.0.143495255584.issue670664@psf.upfronthosting.co.za> |
| 2009-06-04 06:34:08 | momat | link | issue670664 messages |
| 2009-06-04 06:34:00 | momat | create | |