Tether releases QVAC Genesis I, a synthetic dataset for training AI models, and launches QVAC Workbench, an AI application.
QVAC, the AI research arm of Tether Data, has launched the QVAC Genesis initiative and released the synthetic dataset Genesis I. This dataset, containing 41 billion text tokens, will help build smarter and more accurate STEM language models globally. These trained models can learn words and their associated logic. Rigorously validated against educational and scientific benchmarks, it demonstrates superior reasoning and problem-solving capabilities in subjects like mathematics and physics. This is the first publicly available synthetic dataset purpose-built and rigorously validated for educational content, addressing the shortage of publicly available training datasets in key STEM fields. QVAC Genesis I aims to empower AI training in the hands of the public through open, high-quality data.