-
Notifications
You must be signed in to change notification settings - Fork 27.6k
Issues: huggingface/transformers
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
tokenizer_class:
LlamaTokenizerFast
becomes LlamaTokenizer
after load + immediate save
bug
#35832
opened Jan 22, 2025 by
Qubitium
2 of 4 tasks
ImportError: cannot import name 'NoneType' from 'types' on main
bug
#35827
opened Jan 22, 2025 by
harupy
1 of 4 tasks
model.gradient_checkpointing_enable() makes loss.requires_grad be False
bug
#35826
opened Jan 22, 2025 by
ZCWei51
2 of 4 tasks
multi-gpu: test_model_parallel_beam_search tests fail with "IndexError: list index out of range"
#35824
opened Jan 21, 2025 by
dvrogozh
[Feature Request] Support register customize quantization method out-of-tree
Feature request
Request for a new feature
#35814
opened Jan 21, 2025 by
ice-tong
RWKV CUDA error: an illegal memory access was encountered during training from scratch
#35805
opened Jan 21, 2025 by
npkanaka
Significant Increase in Training Loss after Upgrading from Transformers 4.47.1 to 4.48.0
bug
#35787
opened Jan 20, 2025 by
mjkmain
2 of 4 tasks
Auto-resume from checkpoint throws error if last checkpoint is incomplete
bug
#35782
opened Jan 20, 2025 by
SilverSoldier
2 of 4 tasks
LLaVA-OneVision image features and image tokens mismatch
bug
#35775
opened Jan 19, 2025 by
sheryc
2 of 4 tasks
TPU Initialization Error with Transformers in Kaggle TPU VM v3-8
bug
#35774
opened Jan 19, 2025 by
kashifliaqat606
4 tasks
Mamba2 doesn't support Multi-GPU training (fast path)
bug
#35770
opened Jan 19, 2025 by
NadavSc
2 of 4 tasks
Issue: Error with _eos_token_tensor when using Generator with GenerationMixin
bug
#35767
opened Jan 18, 2025 by
surenoobster
1 of 4 tasks
Defining LLM Dataset types in Trainers or during Training Workflow
Feature request
Request for a new feature
#35766
opened Jan 18, 2025 by
mimipynb
Inconsistent output lengths when
max_length=20
is set implicitly vs explicitly in generate()
bug
#35765
opened Jan 18, 2025 by
imantdaunhawer
2 of 4 tasks
How can we use CPU offloading when using AutoModelForCausalLM and THUDM/cogvlm2-llama3-chat-19B
bug
#35751
opened Jan 17, 2025 by
FurkanGozukara
Qwen2VL exhibits significant performance differences under different attention implementations.
bug
#35749
opened Jan 17, 2025 by
masn1310
2 of 4 tasks
pipeline
AttributeError with torch.nn.DataParallel
bug
#35747
opened Jan 17, 2025 by
kerem-coemert
2 of 4 tasks
Audio-Classification pipeline function_to_apply ignores initialized values (possibly generalizes to other classification pipelines)
bug
#35739
opened Jan 16, 2025 by
wilke0818
2 of 4 tasks
Significant Performance Gap Between MaskFormer and Mask2Former Despite Identical Training Code
bug
#35738
opened Jan 16, 2025 by
olmobaldoni
2 of 4 tasks
Audio-Classification Pipeline top_k Documentation mismatch and bug (possibly generalizes to any classification pipelines)
bug
#35736
opened Jan 16, 2025 by
wilke0818
2 of 4 tasks
Previous Next
ProTip!
Find all open issues with in progress development work with linked:pr.