Meta launches Muse Spark to compete with top AI models

TL;DR

Meta's Muse Spark scores 0.9 on GPQA, matching Claude Opus 4.7, as the company shifts from open-source Llama to proprietary AI under Alexandr Wang.

Nine months after Meta committed $14.3 billion to bring Alexandr Wang and Scale AI's infrastructure expertise in-house, the company released its first tangible product from that bet. Muse Spark, the initial model from Meta Superintelligence Labs, arrived on April 8, deliberately framed not as a flagship rival to GPT-5 but as a fast, compact reasoning system.

The framing is a direct response to what went wrong before. Meta's open-source model release in April 2025 failed to generate developer momentum, according to CNBC, prompting Zuckerberg to rebuild the company's AI development approach entirely. Wang's unit reportedly reconstructed Meta's AI stack from scratch in under a year, a pace the company described as unprecedented in its own development history.

Muse Spark, which carried the internal code name Avocado, targets reasoning tasks in science, mathematics, and health. On GPQA, a graduate-level science benchmark tracked by LLM Stats, it scores 0.9, matching Anthropic's Claude Opus 4.7 and Claude Mythos Preview, both released within the same two-week window. Meta is shipping Muse Spark as a proprietary model, a notable departure from the Llama lineage that defined the company's prior artificial intelligence identity.

The efficiency play

Rather than competing at the top end, Meta is positioning Muse Spark as the foundation of a series. The company says a next-generation model is already in development, and the pitch centers on being "small and fast by design" while delivering benchmark-competitive reasoning. That combination has practical appeal: inference costs remain a real constraint for teams deploying AI at production scale.

Shares rose 6.5% on the day of the announcement, though the move overlapped with broader market gains tied to geopolitical developments. Attributing the stock reaction specifically to Muse Spark requires more separation than the data provides.

The competitive environment is unforgiving. OpenAI and Anthropic are now collectively valued above $1 trillion, as CNBC reported, and both companies have accelerated their release pace. Anthropic shipped Opus 4.7 on April 16 and followed with Claude Design the next day, a design prototyping tool that reads codebases and design files to generate brand-consistent assets, as Unite.ai reported. The April artificial intelligence release calendar has become genuinely difficult to parse.

A week in context

For practitioners trying to situate Muse Spark: GPQA parity with Opus 4.7 is notable on paper, but that benchmark tests narrow graduate-level scientific reasoning. It does not capture coding performance, instruction following, or long-context handling. Meta has not published parameter counts, latency numbers, or API access terms as of this writing, which makes the "small and fast" claim difficult to act on.

Wang's background at Scale AI centered on data labeling infrastructure and model evaluation pipelines, not frontier research. That expertise could translate into advantages in training data quality and evaluation rigor, areas where inter-lab gaps are harder to observe but may matter more over time than raw benchmark scores. Whether it does is still an open question. Muse Spark is the first public evidence.

The proprietary release also invites scrutiny. Part of Llama's appeal to developers was fine-tuning flexibility and local deployment. A closed-weights API model competes on a different axis, one where Meta has far less established credibility than its open-source history might suggest.

Meta has reorganized its core artificial intelligence research division around the goal of closing the gap with OpenAI and Google. Muse Spark is a foundation, by the company's own description. Whether the next model in the series arrives before the current leaders extend their lead is the question that actually determines whether this multi-billion dollar repositioning holds.

FAQ

What is Meta Muse Spark?
Muse Spark is the first model from Meta Superintelligence Labs, released on April 8, 2026. It is a compact, proprietary reasoning model built to handle tasks in science, math, and health, and is framed as the opening entry in the Muse series rather than a direct frontier competitor.

How does Muse Spark compare to Claude or GPT?
On GPQA it matches Anthropic's Claude Opus 4.7 with a score of 0.9. Meta has not released detailed evaluations for coding or instruction-following, so a complete head-to-head comparison is not yet possible based on available data.

Is Muse Spark open source?
No. Unlike Meta's Llama models, Muse Spark is proprietary. Meta has not announced open-weight access or fine-tuning options for external developers as of this writing.

Who is Alexandr Wang and what is his role at Meta?
Wang was CEO of Scale AI before Meta made a $14.3 billion investment in the company in mid-2025. He now leads Meta Superintelligence Labs, overseeing the development of the Muse model series.

About the Author

Guilherme A.

Former dentist (MD) from Brazil, 41 years old, husband, and AI enthusiast. In 2020, he transitioned from a decade-long career in dentistry to pursue his passion for technology, entrepreneurship, and helping others grow.

Connect on LinkedIn