limiteinductive
|
f541badcb3
|
Allow optional train ModelConfig + forbid extra input for configs
|
2024-02-10 16:13:10 +01:00 |
|
Pierre Colle
|
25bfa78907
|
lr, betas, eps, weight_decay at model level
Co-authored-by: Cédric Deltheil <355031+deltheil@users.noreply.github.com>
|
2024-02-09 12:05:13 +01:00 |
|
Cédric Deltheil
|
9aefc9896c
|
test_trainer: use model_copy instead of copy
The `copy` method has been deprecated.
|
2024-02-08 20:07:34 +01:00 |
|
Colle
|
f4aa0271b8
|
less than 1 epoch training duration
|
2024-02-08 19:20:31 +01:00 |
|
limiteinductive
|
41508e0865
|
change param name of abstract get_item method
|
2024-02-08 18:52:52 +01:00 |
|
limiteinductive
|
2e526d35d1
|
Make Dataset part of the trainer
|
2024-02-07 16:13:01 +01:00 |
|
limiteinductive
|
2ef4982e04
|
remove wandb from base config
|
2024-02-07 11:06:59 +01:00 |
|
limiteinductive
|
ea05f3d327
|
make device and dtype work in Trainer class
|
2024-02-06 23:10:10 +01:00 |
|
Pierre Chapuis
|
7eb8eb4c68
|
add support for pytorch 2.2 (2.1 is still supported)
also bump all dev dependencies to their latest version
|
2024-01-31 15:03:06 +01:00 |
|
limiteinductive
|
0ee2d5e075
|
Fix warmup steps calculation when gradient_accumulation is used
|
2024-01-25 12:20:36 +01:00 |
|
limiteinductive
|
7f722029be
|
add basic unit test for training_utils
|
2024-01-14 22:08:20 +01:00 |
|