huggingface / transformers Public

Notifications You must be signed in to change notification settings
Fork 27.6k
Star 138k

Code
Issues 981
Pull requests 537
Actions
Projects 1
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Projects
Security
Insights

Issues: huggingface/transformers

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

981 Open 15,531 Closed

Author

Filter by author

Label

Filter by label

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Milestones

Filter by milestone

Assignee

Filter by who’s assigned

Assigned to nobody

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Issues list

tokenizer_class: LlamaTokenizerFast becomes LlamaTokenizer after load + immediate save bug

#35832 opened Jan 22, 2025 by Qubitium

2 of 4 tasks

ImportError: cannot import name 'NoneType' from 'types' on main bug

#35827 opened Jan 22, 2025 by harupy

1 of 4 tasks

model.gradient_checkpointing_enable() makes loss.requires_grad be False bug

#35826 opened Jan 22, 2025 by ZCWei51

2 of 4 tasks

multi-gpu: test_model_parallel_beam_search tests fail with "IndexError: list index out of range"

#35824 opened Jan 21, 2025 by dvrogozh

convert_llama_weight_to_hf.py bug

#35820 opened Jan 21, 2025 by AyushSingh096

1 of 4 tasks

[Feature Request] Support register customize quantization method out-of-tree Feature request

Request for a new feature

#35814 opened Jan 21, 2025 by ice-tong

How to change data

#35807 opened Jan 21, 2025 by kim90000

RWKV CUDA error: an illegal memory access was encountered during training from scratch

#35805 opened Jan 21, 2025 by npkanaka

Significant Increase in Training Loss after Upgrading from Transformers 4.47.1 to 4.48.0 bug

#35787 opened Jan 20, 2025 by mjkmain

2 of 4 tasks

Ascend：Training not loaded into NPU bug

#35785 opened Jan 20, 2025 by CurtainRight

2 of 4 tasks

Auto-resume from checkpoint throws error if last checkpoint is incomplete bug

#35782 opened Jan 20, 2025 by SilverSoldier

2 of 4 tasks

LLaVA-OneVision image features and image tokens mismatch bug

#35775 opened Jan 19, 2025 by sheryc

2 of 4 tasks

TPU Initialization Error with Transformers in Kaggle TPU VM v3-8 bug

#35774 opened Jan 19, 2025 by kashifliaqat606

4 tasks

Mamba2 doesn't support Multi-GPU training (fast path) bug

#35770 opened Jan 19, 2025 by NadavSc

2 of 4 tasks

Issue: Error with _eos_token_tensor when using Generator with GenerationMixin bug

#35767 opened Jan 18, 2025 by surenoobster

1 of 4 tasks

Defining LLM Dataset types in Trainers or during Training Workflow Feature request

Request for a new feature

#35766 opened Jan 18, 2025 by mimipynb

Inconsistent output lengths when max_length=20 is set implicitly vs explicitly in generate() bug

#35765 opened Jan 18, 2025 by imantdaunhawer

2 of 4 tasks

multi-gpu: test_model_parallel_beam_search tests fail with "RuntimeError: Expected all tensors to be on the same device"

#35762 opened Jan 18, 2025 by dvrogozh

How can we use CPU offloading when using AutoModelForCausalLM and THUDM/cogvlm2-llama3-chat-19B bug

#35751 opened Jan 17, 2025 by FurkanGozukara

Qwen2VL exhibits significant performance differences under different attention implementations. bug

#35749 opened Jan 17, 2025 by masn1310

2 of 4 tasks

pipeline AttributeError with torch.nn.DataParallel bug

#35747 opened Jan 17, 2025 by kerem-coemert

2 of 4 tasks

Audio-Classification pipeline function_to_apply ignores initialized values (possibly generalizes to other classification pipelines) bug

#35739 opened Jan 16, 2025 by wilke0818

2 of 4 tasks

Significant Performance Gap Between MaskFormer and Mask2Former Despite Identical Training Code bug

#35738 opened Jan 16, 2025 by olmobaldoni

2 of 4 tasks

Audio-Classification Pipeline top_k Documentation mismatch and bug (possibly generalizes to any classification pipelines) bug

#35736 opened Jan 16, 2025 by wilke0818

2 of 4 tasks

TypeError: 'NoneType' object is not iterable

#35719 opened Jan 15, 2025 by 0xD4rky

Previous 1 2 3 4 5 … 39 40 Next

Previous Next

ProTip! Find all open issues with in progress development work with linked:pr.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly