ENH: support bytearray and unicode (#3) by westurner · Pull Request #4 · kosqx/better-bencode

westurner · 2016-11-13T19:26:43Z

ENH: support {bytearray, unicode} #3
TST: manual test bencode.dumps with Python 2
TST: manual test bencode.dumps with Python 3
TST: test that a .torrent file at least loads (with Transmission)
- is this handling of utf8 unicode strings consistent with which BEP?
  - https://wiki.theory.org/BitTorrentSpecification#Metainfo_File_Structure
    " All character string values are UTF-8 encoded."
TST: write test cases for bytearrays
TST: write test cases for unicodes
PRF: port to _fast.c

notpushkin · 2016-11-16T12:32:19Z

Fails on Python 3, NameError: name 'unicode' is not defined :(

westurner · 2016-11-16T13:53:45Z

Whoops. I think that should only be for Python 2 because in Python 3 all
str are unicode.

On Wednesday, November 16, 2016, Alexander Pushkov notifications@github.com
wrote:

Fails on Python 3, NameError: name 'unicode' is not defined :(

—
You are receiving this because you authored the thread.
Reply to this email directly, view it on GitHub
#4 (comment),
or mute the thread
https://github.com/notifications/unsubscribe-auth/AADGy77ug9XCD5PdA461pQNqM3AtSJzmks5q-vfTgaJpZM4Kwxlh
.

notpushkin · 2016-11-16T14:27:08Z

The error actually happens in the elif t is unicode line. We can safely substitute unicode for str (in py3k) here (as unicode_to_bytes accepts str in python 3), so I suggest this simple fix: Songbee@0bb114b

As suggested by Songbee@0bb114b

westurner · 2016-11-17T16:22:02Z

Good call @iamale

note: this passes w/ _pure.py but not w/ _fast.c

westurner · 2016-11-17T17:04:08Z

TST: What should I add to TEST_DATA to test for unicode strings?
ENH,TST: IDK Python C-API well enough to port this from _pure.py to _fast.c

westurner · 2016-11-17T17:06:51Z

... TIL "All character string values are UTF-8 encoded." https://wiki.theory.org/BitTorrentSpecification#Metainfo_File_Structure

westurner · 2016-11-17T17:09:15Z

    (b'd3:bar4:spam3:fooi42ee', {b'bar': b'spam', b'foo': 42}),
    (b'd1:ai1e1:bi2e1:ci3ee', {b'a': 1, b'b': 2, b'c': 3}),
    (b'd1:a1:be', {b'a': b'b'}),
+    (b'3:\x00\x01\x02', bytearray([0, 1, 2])),


is this correct?

notpushkin · 2016-11-17T17:13:22Z

Also I think it would be nice to decode unicode strings to unicode/str as well (although it might be a bit trickier).

ENH: _pure.py: support bytearray and unicode (kosqx#3)

62d4d89

BUG: _pure.py: unicode = str for py3k

f457423

As suggested by Songbee@0bb114b

westurner added 2 commits November 17, 2016 10:27

TST: test_bencode.py: remove u'' from TESTS_TYPEERROR

7803814

TST: test_bencode.py: add bytearrays to TEST_DATA

1a8d2dc

note: this passes w/ _pure.py but not w/ _fast.c

westurner commented Nov 17, 2016

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ENH: support bytearray and unicode (#3)#4

ENH: support bytearray and unicode (#3)#4
westurner wants to merge 4 commits into
kosqx:masterfrom
westurner:feature/bytearray_and_unicode

westurner commented Nov 13, 2016 •

edited

Loading

Uh oh!

notpushkin commented Nov 16, 2016

Uh oh!

westurner commented Nov 16, 2016

Uh oh!

notpushkin commented Nov 16, 2016

Uh oh!

westurner commented Nov 17, 2016

Uh oh!

westurner commented Nov 17, 2016

Uh oh!

westurner commented Nov 17, 2016

Uh oh!

westurner Nov 17, 2016 •

edited

Loading

Uh oh!

notpushkin commented Nov 17, 2016

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

westurner commented Nov 13, 2016 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

notpushkin commented Nov 16, 2016

Uh oh!

westurner commented Nov 16, 2016

Uh oh!

notpushkin commented Nov 16, 2016

Uh oh!

westurner commented Nov 17, 2016

Uh oh!

westurner commented Nov 17, 2016

Uh oh!

westurner commented Nov 17, 2016

Uh oh!

westurner Nov 17, 2016 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

notpushkin commented Nov 17, 2016

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

westurner commented Nov 13, 2016 •

edited

Loading

westurner Nov 17, 2016 •

edited

Loading