STY: Minor code-style improvements for _reader.py #123

coco-speed · 2024-09-18T04:00:13Z

This pull request was created automatically by CodSpeed to track performance changes of the pull request py-pdf/pypdf#2847.

The original branch is upstream/reader-minor-sty

* DEV: Test against Python 3.13 * fix typo * add missing setup-python * fix another typo * update Pillow version * attempt to update coverage package * update number of expected coverage files

PEP 8 recommendation.

py-pdf#2786) Closes py-pdf#2783.

Signed-off-by: Diogo Teles Sant'Anna <[email protected]>

…ayout mode (py-pdf#2788) * Handle Sequence as an IndirectObject The spec allows an int or float to be an IndirectObject as well, but this commit does not address that theoretical possibility. * Update pypdf/_text_extraction/_layout_mode/_font.py Co-authored-by: Stefan <[email protected]> * Address PR comments -Rename w_1 to w_next_entry -Utilize ParseError instead of PdfReadError -Write a test (both positive and negative) * Handle unlikely case of IndirectObjects for float/int width elements Also adds a comment to clarify that we don't explicitly handle the IndexError exception. Rather, we let it be raised as an IndexError. * Yoda condition I removed * Last commit was a bad patch, confused by non-committed changes * Use test files from URL rather than resources * Update tests/test_text_extraction.py Co-authored-by: pubpub-zz <[email protected]> * Fix code style warnings in range() call --------- Co-authored-by: Stefan <[email protected]> Co-authored-by: pubpub-zz <[email protected]>

Closes py-pdf#2726. Closes py-pdf#2791.

Add compress_identical_objects(). Discovered in py-pdf#2728. Closes py-pdf#2794. Closes py-pdf#2768.

Closes py-pdf#2702.

Closes py-pdf#2761.

Closes py-pdf#2411.

Closes py-pdf#2798.

Closes py-pdf#2754.

Fixes py-pdf#2806.

…pdf#2816) Closes py-pdf#2815.

Closes py-pdf#2812.

Cope with objects where the filter is ["/FlateDecode"] and/or where data has not been read yet.

* DEV: Fix coverage uploads Starting 2024-09-02, hidden files are ignored by default: https://redirect.github.com/actions/upload-artifact/issues/602 * list files * no need to list files

Plus one typo in xmp.py.

Closes py-pdf#2780.

…2818) Closes py-pdf#2817.

* STY: Use f-string = functionality * STY: Use f-string = functionality * STY: Use f-string = functionality Also switch the order of a tuple to match the order of the line above. --------- Co-authored-by: pubpub-zz <[email protected]>

visitor* function arguments are silently ignored when extraction_mode="layout". Document this a bit better and add a warning when these arguments are ignored. Closes py-pdf#2840.

…cleanup (py-pdf#2813)

test_image_without_pillow runs a generated script which causes the Python path to exclude the current directory. The generated script tries to import pypdf and either cannot find it or it finds the version in pyenv instead of the version being tested. Add "." to PYTHONPATH so the correct version of pypdf is used. Closes py-pdf#2849

Co-authored-by: pubpub-zz <[email protected]>

## Version 5.0.0, 2024-09-15 This version drops support for Python 3.7 (not maintained since July 2023), PdfMerger (use PdfWriter instead) and AnnotationBuilder (use annotations instead). ### Deprecations (DEP) - Remove the deprecated PfdMerger and AnnotationBuilder classes and other deprecations cleanup (py-pdf#2813) - Drop Python 3.7 support (py-pdf#2793) ### New Features (ENH) - Add capability to remove /Info from PDF (py-pdf#2820) - Add incremental capability to PdfWriter (py-pdf#2811) - Add UniGB-UTF16 encodings (py-pdf#2819) - Accept utf strings for metadata (py-pdf#2802) - Report PdfReadError instead of RecursionError (py-pdf#2800) - Compress PDF files merging identical objects (py-pdf#2795) ### Bug Fixes (BUG) - Fix sheared image (py-pdf#2801) ### Robustness (ROB) - Robustify .set_data() (py-pdf#2821) - Raise PdfReadError when missing /Root in trailer (py-pdf#2808) - Fix extract_text() issues on damaged PDFs (py-pdf#2760) - Handle images with empty data when processing an image from bytes (py-pdf#2786) ### Developer Experience (DEV) - Fix coverage uploads (py-pdf#2832) - Test against Python 3.13 (py-pdf#2776) [Full Changelog](py-pdf/pypdf@4.3.1...5.0.0)

stefan6419846 and others added 30 commits July 28, 2024 17:16

DEV: Test against Python 3.13 (py-pdf#2776)

4bd54bd

* DEV: Test against Python 3.13 * fix typo * add missing setup-python * fix another typo * update Pillow version * attempt to update coverage package * update number of expected coverage files

STY: Remove boolean value comparison (py-pdf#2779)

d4df20d

PEP 8 recommendation.

ROB: Handle images with empty data when processing an image from bytes (

3ad9234

py-pdf#2786) Closes py-pdf#2783.

SEC: Fix GitHub workflow vulnerable to script injection (py-pdf#2787)

582557e

Signed-off-by: Diogo Teles Sant'Anna <[email protected]>

MAINT: Remove unused paeth_predictor (py-pdf#2773)

38f3925

MAINT: Remove unused AnnotationFlag

09f9b7e

STY: Refactor b_ (py-pdf#2772)

5abd590

MAINT: Drop Python 3.7 support (py-pdf#2793)

219eb13

MAINT: Remove b_ and str_ (py-pdf#2792)

46c89dd

Closes py-pdf#2726. Closes py-pdf#2791.

MAINT: Improve test coverage (py-pdf#2796)

a9758ae

ENH: Compress PDF files merging identical objects (py-pdf#2795)

cf7fcfd

Add compress_identical_objects(). Discovered in py-pdf#2728. Closes py-pdf#2794. Closes py-pdf#2768.

ROB: Fix extract_text() issues on damaged PDFs (py-pdf#2760)

2eb565d

Closes py-pdf#2702.

ENH: Report PdfReadError instead of RecursionError (py-pdf#2800)

d9a8c54

Closes py-pdf#2761.

BUG: Fix sheared image (py-pdf#2801)

799630d

Closes py-pdf#2411.

MAINT: Fix mypy type output (py-pdf#2799)

454a62a

Closes py-pdf#2798.

ENH: Accept utf strings for metadata (py-pdf#2802)

0c81f3c

Closes py-pdf#2754.

MAINT: Remove unused code (py-pdf#2805)

d2d520b

ROB: Raise PdfReadError when missing /Root in trailer (py-pdf#2808)

9f08cd0

Fixes py-pdf#2806.

MAINT: Improve wording of set_data error message (py-pdf#2810)

b7b3c8c

ENH: Robustify on missing font for Tf operator in text_extract() (py-…

f55d332

…pdf#2816) Closes py-pdf#2815.

ENH: Add UniGB-UTF16 encodings (py-pdf#2819)

38ea8c5

Closes py-pdf#2812.

ROB: Robustify .set_data() (py-pdf#2821)

82eac7e

Cope with objects where the filter is ["/FlateDecode"] and/or where data has not been read yet.

DEV: Fix coverage uploads (py-pdf#2832)

e694d55

* DEV: Fix coverage uploads Starting 2024-09-02, hidden files are ignored by default: https://redirect.github.com/actions/upload-artifact/issues/602 * list files * no need to list files

DOC: Small changes to PaperSize notes (py-pdf#2834)

b85c171

Plus one typo in xmp.py.

ENH: Add incremental capability to PdfWriter (py-pdf#2811)

98d4425

Closes py-pdf#2780.

ENH: Robustify parsing for Object streams in XRef rebuilding (py-pdf#…

9d54f63

…2818) Closes py-pdf#2817.

STY: Use f-string = functionality (py-pdf#2835)

c4e95bd

* STY: Use f-string = functionality * STY: Use f-string = functionality * STY: Use f-string = functionality Also switch the order of a tuple to match the order of the line above. --------- Co-authored-by: pubpub-zz <[email protected]>

BUG: Warn when visitor* arguments are ignored (py-pdf#2845)

78baa8f

visitor* function arguments are silently ignored when extraction_mode="layout". Document this a bit better and add a warning when these arguments are ignored. Closes py-pdf#2840.

ENH: Add capability to remove /Info from PDF (py-pdf#2820)

a790532

pubpub-zz and others added 16 commits September 14, 2024 14:20

MAINT: Deprecate PdfMerger, AnnotationBuilder and other deprecations …

1bbc301

…cleanup (py-pdf#2813)

MAINT: Simplify test with None and NullObject (py-pdf#2829)

8ebd311

STY: Minor code-style improvements for _reader.py

ac2983b

Merge branch 'main' into reader-minor-sty

1f0861f

Fix tests

dfa3d5c

Update pypdf/_reader.py

6253b4b

Co-authored-by: pubpub-zz <[email protected]>

fix doc building warning

bc3ae82

Undo is_null_or_none

7a4409f

Undo

dd68fa1

Undo

7510d54

Merge branch 'main' into reader-minor-sty

e21eff3

Merge branch 'main' into reader-minor-sty

27df17e

DOC: Tiny changes (py-pdf#2844)

c00ec60

Merge branch 'main' into reader-minor-sty

847ae54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

STY: Minor code-style improvements for _reader.py #123

STY: Minor code-style improvements for _reader.py #123

coco-speed commented Sep 18, 2024

STY: Minor code-style improvements for _reader.py #123

Are you sure you want to change the base?

STY: Minor code-style improvements for _reader.py #123

Conversation

coco-speed commented Sep 18, 2024