BLOG

Insights, updates, and thoughts on AI, machine learning, and data automation.

Faust-1 — German benchmark performance for the German-first language model

Product March 25, 2026

Faust-1: A German-First Language Model That Runs on Your Laptop

Introducing Faust-1, a 1.6B parameter language model trained from scratch for German. Optimized for local deployment on consumer hardware — no cloud, no data-center GPUs required.

YapBench - Measuring LLM verbosity on simple prompts

Research January 20, 2026

🤐 Do Chatbot LLMs Talk Too Much? Introducing YapBench

We introduce YapBench, a benchmark for measuring how much LLMs over-explain simple questions. Our evaluation of 76 models reveals an order-of-magnitude spread in verbosity, with newer models trending longer.

EU PII Safeguard - On-Premise PII Detection for GDPR and HIPAA Compliance

Product November 20, 2025

EU PII Safeguard: On-Premise PII Detection for GDPR, HIPAA & SOC 2 Compliance

Detect and redact 42 types of personal data across all 24 EU languages with 97% accuracy. Run on-premise - no data leaves your infrastructure. GDPR Article 17 compliant. Alternative to cloud PII APIs.

GReaT - The data pipeline for fine-tuning language models on tabular data

Research November 15, 2025

Generate Realistic Tabular Data with Language Models

Learn how our open-source GReaT framework uses transformer language models to generate high-quality synthetic tabular data. Published at ICLR 2023, with 140,000+ downloads and adopted on Google's Kaggle platform.

BLOG

Faust-1: A German-First Language Model That Runs on Your Laptop

🤐 Do Chatbot LLMs Talk Too Much? Introducing YapBench

EU PII Safeguard: On-Premise PII Detection for GDPR, HIPAA & SOC 2 Compliance

Generate Realistic Tabular Data with Language Models

Request Demo