Issues: Lightning-AI/pytorch-lightning
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Dynamically link arguments in Is an improvement or enhancement
needs triage
Waiting to be triaged by maintainers
LightningCLI
?
feature
#19858
opened May 9, 2024 by
EthanMarx
Is the Lightning App deprecated? (Lightning App doc is not found)
bug
Something isn't working
needs triage
Waiting to be triaged by maintainers
ver: 2.2.x
#19854
opened May 8, 2024 by
guyleaf
RuntimeError: one of the variables needed for gradient computation has been modified by an inplace operation: [torch.cuda.FloatTensor [68]] is at version 3; expected version 2 instead. Hint: enable anomaly detection to find the operation that failed to compute its gradient, with torch.autograd.set_detect_anomaly(True).
bug
Something isn't working
needs triage
Waiting to be triaged by maintainers
ver: 2.2.x
#19853
opened May 8, 2024 by
ASAmbitious
When doing tuner.scale_batch_size, check full dataset length first
feature
Is an improvement or enhancement
needs triage
Waiting to be triaged by maintainers
#19850
opened May 6, 2024 by
fingoldo
ckpt_path
in Trainer
accepts URIs to automatically load checkpoints from remote paths
feature
#19849
opened May 5, 2024 by
aretor
Exception in RecordFunction callback: state_ptr INTERNAL ASSERT FAILED at "../torch/csrc/profiler/standalone/nvtx_observer.cpp":115
bug
Something isn't working
needs triage
Waiting to be triaged by maintainers
#19848
opened May 5, 2024 by
nhkhoi91
trainer.fit from checkpoint without performance improvement will break 'last' link to checkpoint on window11
bug
Something isn't working
needs triage
Waiting to be triaged by maintainers
#19845
opened May 4, 2024 by
workhours
Unable to extract confusion matrix as a metric from trainer
bug
Something isn't working
needs triage
Waiting to be triaged by maintainers
#19835
opened May 1, 2024 by
lathashree01
Loading large models with fabric, FSDP and empty_init=True does not work
bug
Something isn't working
needs triage
Waiting to be triaged by maintainers
#19833
opened May 1, 2024 by
RuABraun
WandbLogger Something isn't working
needs triage
Waiting to be triaged by maintainers
ver: 2.1.x
save_dir
and dir
parameters do not work as expected.
bug
#19830
opened Apr 30, 2024 by
Jigar1201
How to incorporate vLLM in Lightning for LLM inference?
feature
Is an improvement or enhancement
needs triage
Waiting to be triaged by maintainers
#19829
opened Apr 30, 2024 by
YuWang916
TensorBoardLogger has the wrong epoch numbers much more than the fact
bug
Something isn't working
needs triage
Waiting to be triaged by maintainers
ver: 2.1.x
#19828
opened Apr 30, 2024 by
AlbireoBai
OnExceptionCheckpoint: training resumes if ckpt found, even if no ckpt_path provided
bug
Something isn't working
needs triage
Waiting to be triaged by maintainers
#19827
opened Apr 29, 2024 by
brijow
AWS Trainium fails number of device validation when using more than 1 accelerator on the instances
bug
Something isn't working
needs triage
Waiting to be triaged by maintainers
ver: 2.0.x
ver: 2.1.x
#19826
opened Apr 29, 2024 by
BrianF-tessera
Add a warning when some of the modules are in eval mode before the training stage
feature
Is an improvement or enhancement
needs triage
Waiting to be triaged by maintainers
#19820
opened Apr 26, 2024 by
mszulc913
Full validation after first microbatch when training after LearningRateFinder
bug
Something isn't working
needs triage
Waiting to be triaged by maintainers
ver: 2.2.x
#19818
opened Apr 25, 2024 by
clumsy
Multi-node Training with DDP stuck at "Initialize distributed..." on SLURM cluster
bug
Something isn't working
needs triage
Waiting to be triaged by maintainers
#19817
opened Apr 25, 2024 by
OswaldHe
Checkpoint every_n_steps reruns epoch on restore
bug
Something isn't working
needs triage
Waiting to be triaged by maintainers
ver: 2.2.x
#19815
opened Apr 25, 2024 by
heth27
Existing metric keys not moved to device after LearningRateFinder
bug
Something isn't working
needs triage
Waiting to be triaged by maintainers
ver: 2.2.x
#19813
opened Apr 25, 2024 by
clumsy
Issue in Manual optimisation, during self.manual_backward call
bug
Something isn't working
needs triage
Waiting to be triaged by maintainers
ver: 2.0.x
#19810
opened Apr 25, 2024 by
pranavrao-qure
Differentiate testing multiple sets/models when logging
feature
Is an improvement or enhancement
needs triage
Waiting to be triaged by maintainers
#19809
opened Apr 25, 2024 by
leleogere
Current FSDPPrecision does not support custom scaler for 16-mixed precision
bug
Something isn't working
needs triage
Waiting to be triaged by maintainers
#19803
opened Apr 23, 2024 by
SongzhouYang
FSDP Strategy checkpoint loading
feature
Is an improvement or enhancement
needs triage
Waiting to be triaged by maintainers
#19802
opened Apr 23, 2024 by
xin-w8023
Construct objects from yaml by classmethod
feature
Is an improvement or enhancement
needs triage
Waiting to be triaged by maintainers
#19801
opened Apr 22, 2024 by
Boltzmachine
parsing issue with Something isn't working
needs triage
Waiting to be triaged by maintainers
ver: 2.2.x
save_last
parameter of ModelCheckpoint
bug
#19799
opened Apr 22, 2024 by
mariovas3
Previous Next
ProTip!
Follow long discussions with comments:>50.