kijai
|
ee728a795f
|
Default to fused rms_norm
|
2026-04-30 14:30:52 +03:00 |
|
kijai
|
4257b8f35c
|
Use embed scale class instead of buffer
Slight difference to HF, but technically more accurate and simpler code
|
2026-04-27 19:05:38 +03:00 |
|
kijai
|
80af032762
|
Code cleanup
|
2026-04-21 16:44:27 +03:00 |
|
kijai
|
845eb14425
|
Handle thinking tokens different only for Gemma4
|
2026-04-13 23:40:52 +03:00 |
|
kijai
|
e0cccbd4c9
|
Fix image encoder attention mask type
So it works with basic attention
|
2026-04-13 23:29:27 +03:00 |
|
kijai
|
387e8d8a4c
|
small fixes
|
2026-04-12 18:27:26 +03:00 |
|
kijai
|
0fc398a821
|
Various fixes
|
2026-04-12 18:11:07 +03:00 |
|
kijai
|
17ccff25be
|
Cleanup
|
2026-04-12 16:53:12 +03:00 |
|
kijai
|
6b803abe5a
|
update comment
|
2026-04-12 16:30:49 +03:00 |
|
kijai
|
6718be09ba
|
cleanup, enable fused rms norm by default
|
2026-04-10 15:28:26 +03:00 |
|
kijai
|
05eaceafa1
|
Cleanup, video fixes
|
2026-04-07 12:37:29 +03:00 |
|
kijai
|
93e8635110
|
parity with reference implementation
outputs can 100% match transformers with same sdpa flags, checkpoint this and then optimize
|
2026-04-07 01:15:04 +03:00 |
|
kijai
|
832753f497
|
initial gemma4 support
|
2026-04-03 03:46:45 +03:00 |
|