Memory contents leak when using invalid utf-8 for searches

Memory contents leak when using invalid utf-8 for searches

Issue ID:2798
Issue Category:security
Component:core
Priority:normal
Status:fixed
Assigned:brion
Version:0.9

Memory contents leak when using invalid utf-8 for searches: See
what appears in the "Keywords" input field:
http//identi.ca/search/notice?q=%E0%81

it seems to actually be a PHP/libxml2 bug, reported upstream:
https://bugs.edge.launchpad.net/php/+bug/655442

Updates

#1

Status:active» fixed

This bad boy resolves it for now by kicking out bad UTF-8 web input:

[ebfa8bc] Basic validation of UTF-8 input via GET/POST vars: invalid UTF-8 sequences will cause the string to drop. Not necessarily super-thorough; should be improved in future to drop individual bad sequences

But if bogus data is floating around elsewhere in the system and gets shoved straight into an XMLWriter, similar could still happen.

Login or Register to modify this issue, or to receive updates by email.

You can also subscribe to the RSS feed for updates to this issue.