-
Notifications
You must be signed in to change notification settings - Fork 1.2k
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
fix: Match Presto's behavior for invalid UTF-8 in url_decode (#11604)
Summary: Pull Request resolved: #11604 Presto Java converts the URL to a Java String after decoding it in url_decode. Java replaces bytes in an invalid UTF-8 character with 0xEF 0xBF 0xBD. Velox decodes invalid UTF-8 characters as is, which leads to differences in results from Java and C++. This diff adds a check when decoding URLs for invalid UTF-8 characters and does the same replacement as Java. Reviewed By: xiaoxmeng Differential Revision: D66249265 fbshipit-source-id: d6993f8c70e72368b85b4c64814a45a1359f0802
- Loading branch information
1 parent
bfaa9a3
commit 9ba0197
Showing
4 changed files
with
154 additions
and
46 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters