#TNGTech
Recap #tngbtd: Our 18th conference with 1,400 guests on-site, additional remote participants, 40 talks, and 74 speakers was a huge success! Recordings of most talks will be available soon at www.youtube.com/@tngtech. Thanks to everyone involved in this event. 🚀
October 28, 2025 at 10:24 AM
今日のHuggingFaceトレンド

tngtech/DeepSeek-TNG-R1T2-Chimera
本リポジトリは、DeepSeekの複数の既存モデルを「Assembly of Experts」手法で統合して開発された、新しい大規模言語モデル「DeepSeek-TNG R1T2 Chimera」(671B)を公開することを目的としています。
先行モデルで課題となっていたトークンの一貫性問題を解決し、新たな知能と出力トークン長の最適なバランスを提示しています。
モデルの仕様、構築方法、および詳細な評価結果が共有されています。
tngtech/DeepSeek-TNG-R1T2-Chimera · Hugging Face
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
huggingface.co
July 10, 2025 at 10:19 AM
DeepSeek-R1T-Chimera hits R1-level reasoning, runs much faster, and uses ~40% fewer output tokens. It's built by merging routed experts from R1 and V3. huggingface.co/tngtech/Dee...
tngtech/DeepSeek-R1T-Chimera · Hugging Face
huggingface.co
April 28, 2025 at 1:28 PM
DeepSeek-TNG-R1T2-Chimera Article URL: https://huggingface.co/tngtech/DeepSeek-TNG-R1T2-Chimera Comments URL: https://news.ycombinator.com/item?id=44449540 Points: 3 # Comments: 0

Origin | Interest | Match
tngtech/DeepSeek-TNG-R1T2-Chimera · Hugging Face
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
huggingface.co
July 2, 2025 at 10:59 PM
Serving #LLMs for over 50 applications, thereby consuming more than 100M tokens while generating over 10M tokens/day, requires us to carefully tune our request processing. Have a look at our article on @hf.co 👉 huggingface.co/blog/tngtech...
June 12, 2025 at 8:56 AM
今日のHuggingFaceトレンド

tngtech/DeepSeek-TNG-R1T2-Chimera
このリポジトリは、複数の既存のDeepSeekモデルを統合・改良した新しいテキスト生成向け大規模言語モデル「DeepSeek-TNG-R1T2-Chimera」を公開するものです。
以前のモデルの課題を解決し、知能と出力効率を向上させることを目的としています。
tngtech/DeepSeek-TNG-R1T2-Chimera · Hugging Face
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
huggingface.co
July 8, 2025 at 10:19 AM
There are a few proposed solutions:

- TNG Tech's DeepSeek-R1T-Chimera ( huggingface.co/tngtech/Deep... )
- Moonshot AI's long2short methods as documented in Kimi k1.5: Scaling Reinforcement Learning with LLMs ( arxiv.org/abs/2501.12599 )
April 29, 2025 at 6:14 PM
今日のHuggingFaceトレンド

tngtech/DeepSeek-TNG-R1T2-Chimera
このリポジトリは、DeepSeekの複数の親モデル(R1-0528、R1、V3-0324)を「Assembly of Experts」手法で統合して構築された、新しい大規模言語モデル「DeepSeek-TNG R1T2 Chimera」(671B)を公開することを目的としています。
このモデルは、以前のモデルの課題を解決し、知能と出力トークン長のバランスを改善した最先端の言語生成能力を提供します。
tngtech/DeepSeek-TNG-R1T2-Chimera · Hugging Face
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
huggingface.co
July 6, 2025 at 10:18 AM
if you want to be excited about something, check out

huggingface.co/tngtech/Deep...

they merged Deepseek R1 and v3 and found that it got much smarter than v3 alone, but without COT - and according to some people that have tried it, it's vibes check out

It might be on some API providers soon
tngtech/DeepSeek-R1T-Chimera · Hugging Face
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
huggingface.co
April 28, 2025 at 6:13 AM
Serving #LLMs to multiple applications and users is challenging due to limited GPU resources. Our colleague Benjamin Merkel explores common issues in request queueing and potential solutions such as fair scheduling based on metrics in his article: huggingface.co/blog/tngtech...
April 4, 2025 at 10:45 AM
DeepSeek-TNG-R1T2-Chimera Article URL: https://huggingface.co/tngtech/DeepSeek-TNG-R1T2-Chimera Comments URL: https://news.ycombinator.com/item?id=44449540 Points: 3 # Comments: 0

Interest | Match | Feed
Origin
huggingface.co
July 2, 2025 at 10:58 PM
Thomas Endres und das @tngtech Team nehmen euch bei der #KINAVIGATOR mit auf eine Reise durch die Welt der KI: Neuronale Netze zur Erzeugung von Kunst, Deepfakes in Echtzeit, ein NLP-Chatbot und eine KI, die Social-Media-Kommentare generiert. https://scomp.ly/dBaxle8
November 27, 2024 at 7:22 PM
R1 Chimera: a model merge of the routed experts of DeepSeek R1 and V3

The resulting merged model performs as well as R1 but without the wandering thought traces. Just as smart, but faster.

huggingface.co/tngtech/Deep...
April 27, 2025 at 11:29 AM
The @tngtech #bigtechday was amazing. Met a couple of old and new faces.

The talk of the #schwarzdigits group was pretty cool. They sound pretty mature.

Yet I noticed that I'm not so good in getting in touch with new people on such a big event. 😕
October 24, 2025 at 6:32 PM
At TNG, we handle 5,000+ #LLM requests per hour and generate 10+ million tokens every day. Learn how our team optimizes inference serving for low-latency responses in high-traffic environments in the second article of our series on LLM performance: huggingface.co/blog/tngtech...
April 16, 2025 at 10:19 AM
TNG Tech created a LLM-Chimera by merging DeepSeek-R1 and DeepSeek-V3, combining the reasoning capabilities of R1 with the token efficiency improvements of V3.

In benchmarks, it appears to be as smart as R1 but much faster, using 40% fewer output tokens.

huggingface.co/tngtech/Deep...
tngtech/DeepSeek-R1T-Chimera · Hugging Face
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
huggingface.co
April 29, 2025 at 2:56 PM
siga este tutorial

www.reddit.com/u/imowlekk/s...

são 200 msg grátis por dia

os modelos do deepseek sao

deepseek-ai/DeepSeek-R1

deepseek-ai/DeepSeek-R1-0528

tngtech/DeepSeek-R1T-Chimera

microsoft/MAI-DS-R1-FP8

deepseek-ai/DeepSeek-V3-0324
deepseek-ai/DeepSeek-V3
deepseek-ai/DeepSeek-V3-Base
July 1, 2025 at 6:27 PM
DeepSeek-TNG-R1T2-Chimera
L: https://huggingface.co/tngtech/DeepSeek-TNG-R1T2-Chimera
C: https://news.ycombinator.com/item?id=44449540
posted on 2025.07.02 at 18:32:17 (c=0, p=4)
July 2, 2025 at 11:31 PM
We recently fine-tuned an Optical Character Recognition AI model based on #olmOCR to help us automate our document processing workflows. In an article on Hugging Face, we discuss how we trained the Vision Language Model and also share the model weights. #OCR #VLM

huggingface.co/blog/tngtech...
Finetuning olmOCR to be a faithful OCR-Engine
A Blog post by TNG Technology Consulting GmbH on Hugging Face
huggingface.co
April 23, 2025 at 10:16 AM
It is not a finetune or distillation, but constructed from neural network parts of both parent MoE models.

huggingface.co/tngtech/Deep...
tngtech/DeepSeek-R1T-Chimera · Hugging Face
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
huggingface.co
April 27, 2025 at 7:47 PM