Couple feedparser bugs fixed

March 19, 2008 – 18:19

If you massively use python feedparser, sooner or later you’ll encounter a feed with non-ascii symbols that tracebacks with somethings like

UnicodeDecodeError: 'ascii' codec can't decode byte 0xc3 in position 10: ordinal not in range(128)

or even

IndexError: pop from empty list

Using latest version from svn helps, but even it has some bugs 1, 2 in microformats parsing.

Since I use feedparser actively on chytach, I tend follow feedparser svn and to fix bugs quickly.

In case you need, I’ve opened a git repository with fixes on Gitorious

RPM with the fixes is on the way to the ALT Linux Sisyphus

  • http://utvbloggen.se Jonas

    Cool! Thanks. I will grab a fresh version and try!

  • http://www.babbledog.com Gregg :omd

    It seems like there a lot of fixes that aren’t making it into the tree at feedparser’s bug forum. Maybe Mark is too busy over at big G to look after the project anymore?

    Gregg

  • Andrey Khavryuchenko

    @Gregg:

    Judging from the number of open defects, it might well be. But if someone cares to submit a patch for a defect (or point to a ticket/post with that), I’ll happily integrate it into my git.