Introducing two new tokenizer-free LLM checkpoints from our research lab: TFree-HAT 7B
Built on our Hierarchical Autoregressive Transformer (HAT) architecture, these models achieve top-tier German and English performance while processing text on a UTF-8 byte level.
Aleph Alpha
493 posts
Our mission is a European generalizable AI. We're hiring: jobs.ashbyhq.com/AlephAlpha #AGI, #artificialintelligence, #writtenbyahuman,#writtenbyanAI
- We are excited to launch our two models Pharia-1-LLM-7B-control and Pharia-1-LLM-7B-control-aligned. Both models and the code used to train them are now publicly available and open-sourced for non-commercial research and educational use. Read our model blog post here:
00:00 - Today we introduce T-Free, a new paradigm in language processing. Tokenization is one of the core building blocks of large language models (LLMs), transforming natural language into numeric representations for further processing. (1/3) 🔗 lnkd.in/eTi7kjuc
00:00 - Today our partners have committed 500M$ towards our mission to push the frontiers of responsible and human-centric AI technology. (1/4) 🔗aleph-alpha.com/aleph-alpha-ra… #writtenbyalephalpha
- 🚀 Exciting Announcement from Davos: Aleph Alpha Unveils Tokenizer-Free LLMs! 🚀 We’re thrilled to announce a pioneering innovation that was unveiled yesterday at the World Economic Forum in Davos: Aleph Alpha has introduced a groundbreaking tokenizer-free (T-Free) LLM
- New in Neural Network Parametrization Technique: Introducing Unit-Scaled Maximal Update Parametrization (u-μP). In partnership with @GCResearchTeam, u-μP merges μP and Unit Scaling to boost training stability & hyperparameter transfer across model sizes. Read more about the
- We present to you...LUMINOUS! Aleph Alpha's multilingual, multicapable, generalizable AI language model "Made in Europe" that helps you get the most out of your data. youtu.be/wi73fB19-qI #writtenbyahuman
- We have compared #Luminous to @OpenAI Davinci, @metaai OPT, and @BigscienceW BLOOM, and the results are astonishing. A Thread 1/11 🔗aleph-alpha.com/luminous-perfo… #writtenbyalephalpha
- We're thrilled to announce Lab 1141, a groundbreaking collaboration between Aleph Alpha Research and @TUDarmstadt, with the goal of pushing the boundaries of explainable, safe, and transparent GenAI. 🔗 aleph-alpha.com/aleph-alpha-re… #writtenbyalephalpha
- At #AlephAlpha we are developing Large Language Models (LLMs) that can be trusted in real-world applications. Our most recent work is AtMan, an #ExplainableAI method providing explanations of generative transformer models at almost no extra cost. A Thread 1/9 #writtenbyalephalpha
- For #ICLR2025 we are unveiling a new, high-quality pretraining dataset for German LLMs. Shared to strengthen the open research community. Shaped by our belief in excellence and transparency. huggingface.co/datasets/Aleph…
- Today we also launch PhariaAI, our new end-to-end AI solution, for enterprises and governments, to built develop, deploy and distribute sovereign generative AI technology. PhariaAI combines proprietary innovation with our new state-of-the-art open-source models and components














