Pinned
GLM-5.2 1M context is now practical on a single 8×H200 node.
We quantized @Zai_org GLM-5.2 to W4AFP8, validated it with @lmsysorg , and kept benchmark quality essentially intact. @bgmshana ❤️
Model weights: huggingface.co/PhalaCloud/GLM…
00:00







