LLM Distillation Multi-Level Tutorial

A Coding Implementation to Build a Hierarchical Planner AI Agent Using Open-Source LLMs with Tool Execution and Structured Multi-Agent Reasoning

In this tutorial, we build a hierarchical planner agent using an open-source instruct model. We design a structured multi-agent architecture comprising a planner agent, an executor agent, and an ...

VentureBeat

Microsoft's new AI training method eliminates bloated system prompts without sacrificing model performance

In building LLM applications, enterprises often have to create very long system prompts to adjust the model’s behavior for their applications. These prompts contain company knowledge, preferences, and ...

InfoWorld

Multi-token prediction technique triples LLM inference speed without auxiliary draft models

With reported 3x speed gains and limited degradation in output quality, the method targets one of the biggest pain points in production AI systems: latency at scale. High inference latency and ...

InfoWorld

Anthropic alleges large-scale distillation campaigns targeting Claude

The AI company claims DeepSeek, Moonshot, and MiniMax used fraudulent accounts and proxy services to extract Claude’s capabilities at scale, even as experts point out that the industry itself relies ...

Infosecurity-magazine.com

Chinese AI Firms Hit Claude with Distillation Attacks, Anthropic Warns

Generative AI firm Anthropic said three Chinese AI companies have generated millions of queries with the Claude large language model (LLM) in order to copy the model – a technique called ‘model ...

Mashable

Anthropic: Chinese AI firms created 24,000 fraudulent accounts for 'distillation attacks'

Anthropic is accusing three Chinese artificial intelligence companies of "industrial-scale campaigns" to "illicitly extract" its technology using distillation attacks. Anthropic says these companies ...

Ars Technica

Attackers prompted Gemini over 100,000 times while trying to clone it, Google says

On Thursday, Google announced that “commercially motivated” actors have attempted to clone knowledge from its Gemini AI chatbot by simply prompting it. One adversarial session reportedly prompted the ...

Bleeping Computer

Google says hackers are abusing Gemini AI for all attacks stages

State-backed hackers are using Google's Gemini AI model to support all stages of an attack, from reconnaissance to post-compromise actions. Bad actors from China (APT31, Temp.HEX), Iran (APT42), North ...

VentureBeat

MIT's new fine-tuning method lets LLMs learn new skills without losing old ones

When enterprises fine-tune LLMs for new tasks, they risk breaking everything the models already know. This forces companies to maintain separate models for every skill. Researchers at MIT, the ...

marktechpost

How to Build Multi-Layered LLM Safety Filters to Defend Against Adaptive, Paraphrased, and Adversarial Prompt Attacks

In this tutorial, we build a robust, multi-layered safety filter designed to defend large language models against adaptive and paraphrased attacks. We combine semantic similarity analysis, rule-based ...

Search Engine Land

What 2 million LLM sessions reveal about AI discovery

We analyzed nearly two million LLM sessions across nine industries from January through December 2025. We started with a simple assumption: ChatGPT dominates, usage patterns are uniform, and the ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results