Package: rawdog
Version: 2.10.dfsg.1-1
Severity: important
Tags: patch
*** Please type your report below this line ***
While parsing various RSS feeds, RawDog has been increasingly failing with the
error noted below. Upon installing the 2.11 version at the author's web site,
http://offog.org/code/rawdog.html the problem was resolved. The error:
Traceback (most recent call last):
File "/usr/bin/rawdog", line 31, in <module>
launch()
File "/usr/bin/rawdog", line 24, in launch
sys.exit(main(sys.argv[1:]))
File "/var/lib/python-support/python2.5/rawdoglib/rawdog.py", line 1489, in
main
rawdog.write(config)
File "/var/lib/python-support/python2.5/rawdoglib/rawdog.py", line 1371, in
write
self.write_output_file(articles, article_dates, config)
File "/var/lib/python-support/python2.5/rawdoglib/rawdog.py", line 1305, in
write_output_file
self.write_article(f, article, config)
File "/var/lib/python-support/python2.5/rawdoglib/rawdog.py", line 1165, in
write_article
description = detail_to_html(entry_info[key], False, config,
force_preformatted)
File "/var/lib/python-support/python2.5/rawdoglib/rawdog.py", line 151, in
detail_to_html
return sanitise_html(html, detail["base"], inline, config)
File "/var/lib/python-support/python2.5/rawdoglib/rawdog.py", line 79, in
sanitise_html
html = feedparser._resolveRelativeURIs(html, baseurl, "UTF-8")
File "/var/lib/python-support/python2.5/rawdoglib/feedparser.py", line 1633,
in _resolveRelativeURIs
p.feed(htmlSource)
File "/var/lib/python-support/python2.5/rawdoglib/feedparser.py", line 1480,
in feed
sgmllib.SGMLParser.feed(self, data)
File "/usr/lib/python2.5/sgmllib.py", line 99, in feed
self.goahead(0)
File "/usr/lib/python2.5/sgmllib.py", line 133, in goahead
k = self.parse_starttag(i)
File "/usr/lib/python2.5/sgmllib.py", line 291, in parse_starttag
self.finish_starttag(tag, attrs)
File "/usr/lib/python2.5/sgmllib.py", line 333, in finish_starttag
self.unknown_starttag(tag, attrs)
File "/var/lib/python-support/python2.5/rawdoglib/feedparser.py", line 1628,
in unknown_starttag
_BaseHTMLProcessor.unknown_starttag(self, tag, attrs)
File "/var/lib/python-support/python2.5/rawdoglib/feedparser.py", line 1497,
in unknown_starttag
value = unicode(value, self.encoding)
File "/usr/lib/python2.5/encodings/utf_8.py", line 16, in decode
return codecs.utf_8_decode(input, errors, True)
UnicodeDecodeError: 'utf8' codec can't decode bytes in position 22-23: invalid
data