Commit Graph

14 Commits

Author SHA1 Message Date
Yousef Rafat
b84af5b947 small attention fix 2025-11-17 23:03:52 +02:00
Yousef Rafat
3f71760913 resblock fix 2025-11-17 06:50:54 +02:00
Yousef Rafat
61b1efdaf0 vectrozied correct implementation of moe forward 2025-11-16 19:25:37 +02:00
Yousef Rafat
4a5509a4c5 . 2025-11-16 16:20:35 +02:00
Yousef Rafat
d731c58353 improving performance and fixing race condition 2025-11-16 16:19:39 +02:00
Yousef Rafat
12cc6924ac meta init 2025-11-14 20:10:52 +02:00
Yousef Rafat
7b4c1e8031 async cache revamp
Added an async loading and offloading of moe layers, having consistent memory with oom errors.
Used to give oom error after the third layer with 24 giga bytes gpu, now goes to the end with consistent memory with minimal latency
2025-11-14 09:15:16 +02:00
Yousef Rafat
44346c4251 removed all errors 2025-11-08 19:49:02 +02:00
Yousef Rafat
5056a1f4d4 important fixes 2025-11-06 00:24:49 +02:00
Yousef Rafat
9e9c536c8e fixes from testing 2025-11-04 23:55:16 +02:00
Yousef Rafat
ca119c44fb returned kv cache for image generation 2025-11-01 23:06:11 +02:00
Yousef Rafat
10a17dc85d a bunch of fixes 2025-11-01 16:40:49 +02:00
Yousef Rafat
a2fff60d4c vectorized implementation of moe/fixes for issues 2025-10-31 23:53:13 +02:00
Yousef Rafat
de43880bdb Hunyuan Image 3.0 2025-10-31 18:56:20 +02:00