Since model saving is not implemented for the mixed quant system this breaks model saving for every scaled fp8 model which needs to be fixed before this gets merged.
Add clip g long clip support. Text encoder refactor. Support llama models with different vocab sizes.