<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0">
    <channel>
        <title>VentureBeat</title>
        <link>https://venturebeat.com/feed/</link>
        <description>Transformative tech coverage that matters</description>
        <lastBuildDate>Mon, 15 Jun 2026 19:54:04 GMT</lastBuildDate>
        <docs>https://validator.w3.org/feed/docs/rss2.html</docs>
        <generator>https://github.com/jpmonette/feed</generator>
        <language>en</language>
        <copyright>Copyright 2026, VentureBeat</copyright>
        <item>
            <title><![CDATA[Satya Nadella warns that AI could hollow out entire industries, echoing the damage done by globalization]]></title>
            <link>https://venturebeat.com/technology/satya-nadella-warns-that-ai-could-hollow-out-entire-industries-echoing-the-damage-done-by-globalization</link>
            <guid isPermaLink="false">4bprhv8nlBJ8rUGE29B4Wf</guid>
            <pubDate>Mon, 15 Jun 2026 19:49:14 GMT</pubDate>
            <description><![CDATA[<p>Microsoft CEO Satya Nadella <a href="https://x.com/satyanadella/status/2066182223213293753">published a sweeping essay</a> on Sunday laying out what he describes as the defining economic challenge of the AI era: the risk that a handful of frontier models will absorb the expertise of entire industries and commoditize it, leaving businesses stripped of their competitive moats.</p><p>&quot;The last thing any of us want is a world where every company across every sector is ceding value to a few models that eat everything they see,&quot; Nadella wrote in the piece, titled &quot;A frontier without an ecosystem is not stable,&quot; which he posted on X. &quot;If all the value is accrued by only a few models, the political economy will simply not tolerate it. There is no societal permission for an AI future that hollows out entire industries.&quot;</p><p>The essay is unusually philosophical for a sitting CEO of a $3 trillion technology company. But it arrives at a moment when the theoretical risks Nadella describes are becoming tangible — and, critically, when Microsoft itself is grappling with the very dynamics he warns about.</p><h2>Nadella introduces &quot;token capital&quot; as the new currency of enterprise AI strategy</h2><p>At the center of Nadella&#x27;s essay sits a conceptual framework built on two pillars he calls &quot;<a href="https://x.com/satyanadella/status/2066182223213293753">human capital</a>&quot; and &quot;<a href="https://x.com/satyanadella/status/2066182223213293753">token capital</a>.&quot; Human capital, he writes, &quot;comprises the knowledge, judgment, relationships, ingenuity, and pattern recognition of its people,&quot; while token capital refers to &quot;the firm&#x27;s AI capability it builds and owns.&quot;</p><p>The two are not in tension, he insists. &quot;Importantly, human capital does not become less valuable as token capital grows. It only becomes more valuable!&quot; he writes. &quot;I believe human agency will be the driver of token capital growth. Humans will set ambitious goals, connect dots across domains, build relationships, and recognize patterns that matter most. Without human direction, you have compute running in circles.&quot;</p><p>This framing is a deliberate counterweight to the narrative that <a href="https://hub.jhu.edu/2026/02/23/will-ai-make-human-workers-obsolete/">AI will simply replace human workers</a> or, at the enterprise level, dissolve the intellectual property that differentiates one company from another. Nadella is arguing that the real danger is not AI&#x27;s capability but its tendency to centralize — and that the solution requires a fundamentally new architecture for how businesses interact with the technology.</p><p>He describes the real opportunity as &quot;not in picking the best model but instead in building a learning loop on top of models where human capital and token capital compound.&quot; The key test of a company&#x27;s sovereignty in this new era, he writes, is whether it can &quot;switch out a &#x27;generalist&#x27; model without losing the &#x27;company veteran&#x27; expertise built into their learning system.&quot;</p><p>This is the essay&#x27;s most actionable claim — and its most provocative. Nadella is telling enterprises they need to decouple their institutional intelligence from whatever frontier model they happen to be running, creating portable knowledge systems that survive vendor changes.</p><h2>Why Nadella is comparing AI concentration to the outsourcing crisis that gutted industrial economies</h2><p>Nadella draws a pointed historical parallel to make his warning concrete. &quot;Think about what happened in the first phase of globalization where entire industrial economies were hollowed out by outsourcing,&quot; he writes. &quot;The GDP numbers looked fine on the surface, but the displacement was real and the consequences are still being felt. Let us not bring that dynamic into the AI era, with a small number of AI systems capturing all the economic returns, while entire industries find their knowledge commoditized right out from underneath them.&quot;</p><p>The globalization analogy is not accidental. It reframes the AI concentration debate from a narrow technology question into a political-economy argument — one that regulators, policymakers, and voters can grasp. By invoking the social costs of offshoring, Nadella is signaling that the stakes extend well beyond the enterprise technology stack. He is warning that if the AI industry fails to distribute value broadly, the political system will intervene to force the issue.</p><p>&quot;In my view, our priority has to be building a frontier ecosystem, not just a frontier model, so value flows broadly across every company, every industry, and every country,&quot; he writes. He grounds this in an older platform philosophy: &quot;This is the ethos I&#x27;ve grown up with where platforms enable more value on top than is captured inside, and where every company can continuously innovate and build value of its own.&quot; It is a direct echo of the Windows-era argument, updated for the age of inference — and it carries a similarly self-interested subtext, given that Microsoft&#x27;s cloud business sits squarely in that platform layer.</p><h2>Microsoft&#x27;s own runaway AI costs reveal the gap between Nadella&#x27;s vision and operational reality</h2><p>What makes Nadella&#x27;s essay so striking is its timing. He published it on a day when Reuters reported that <a href="https://www.reuters.com/business/microsoft-sued-by-shareholders-over-expenses-cloud-business-ai-2026-06-15/">Microsoft shareholders filed a proposed class-action lawsuit</a> in Seattle federal court, accusing the company of inflating its stock price by failing to disclose slowing growth in its Azure cloud business and the need to spend billions of dollars on AI infrastructure. The suit names Nadella and Chief Financial Officer Amy Hood among the defendants.</p><p>As the <a href="https://finance.yahoo.com/markets/stocks/articles/msft-stock-rises-despite-shareholder-180947071.html">Yahoo Finance report</a> on the lawsuit noted, Microsoft allegedly &quot;aggressively promoted its AI developments, specifically its &#x27;Copilot&#x27; assistant and close financial alliance with ChatGPT creator OpenAI, to artificially boost investor optimism,&quot; while understating infrastructure strain and capital risks. Microsoft also reported <a href="https://www.reuters.com/business/retail-consumer/microsoft-edges-past-cloud-growth-expectations-2026-01-28/">$37.5 billion of capital spending</a> in its second quarter, up nearly 66% from a year earlier and above the $34.3 billion that analysts projected.</p><p>Microsoft&#x27;s internal cost pressures around AI have surfaced in other concrete ways this year. The company is <a href="https://www.theverge.com/tech/930447/microsoft-claude-code-discontinued-notepad">canceling the majority of its internal Claude Code licenses</a> in its Experiences and Devices division, effective June 30, 2026. Monthly usage rates reached 84 to 95% by April 2026, and per-engineer API costs ranged between $500 and $2,000 monthly, according to <a href="https://windowsforum.com/threads/microsoft-cancels-internal-claude-code-licenses-pushes-copilot-cli-by-2026.418482/">Windows Forum</a>. The cancellation came after Microsoft exhausted portions of its annual AI budget due to token-based billing, as <a href="https://fortune.com/2026/05/22/microsoft-ai-cost-problem-tokens-agents/">Fortune</a> had reported in May.</p><p>The Claude Code episode illustrates, at the micro level, the exact dynamic Nadella describes at the macro level. When a company&#x27;s AI usage is metered by the token — the fundamental unit of compute that powers model inference — the more productive the tool becomes, the more expensive it gets. The term &quot;token capital&quot; in Nadella&#x27;s essay carries a double meaning: it refers both to a firm&#x27;s proprietary AI capability and, implicitly, to the actual tokens consumed in running it. Building a learning loop that compounds is aspirational. Paying the bills for that loop is operational reality.</p><h2>Uber, Meta, and Amazon are all hitting the same AI spending wall — and it validates Nadella&#x27;s warning</h2><p>Microsoft is not alone in this bind. <a href="https://finance.yahoo.com/sectors/technology/articles/uber-burned-entire-2026-ai-180347400.html">Uber burned through its entire 2026 AI coding tools budget</a> in just four months after incentivizing employees to adopt the technology through an internal leaderboard ranking teams by total AI tool usage. Uber has since instituted a monthly $1,500 cap per employee per agentic coding tool, according to <a href="https://techcrunch.com/2026/06/02/uber-caps-employee-ai-spending-after-blowing-through-budget-in-four-months/">TechCrunch</a>. At Meta, an employee created a leaderboard called &quot;<a href="https://finance.yahoo.com/sectors/technology/articles/meta-just-killed-dashboard-let-084400197.html">Claudeonomics</a>&quot; to track which workers consumed the most AI tokens. Amazon, meanwhile, has pushed employees to &quot;<a href="https://fortune.com/2026/05/12/amazon-tokenmaxxing-claude-ai-capex-meta-gil-luria/">tokenmaxx</a>&quot; — use as many AI tokens as possible.</p><p>The emerging pattern is clear: enterprises adopted AI coding tools aggressively, saw genuine productivity gains, and then discovered that the consumption-based economics of frontier models created budget crises that traditional software licensing never would have. Bryan Catanzaro, vice president of applied deep learning at Nvidia, captured the tension bluntly in an <a href="https://fortune.com/article/why-is-the-cost-of-ai-higher-than-human-workers-nvidia-executive/">interview with Axios</a>: &quot;For my team, the cost of compute is far beyond the costs of the employees,&quot; he said.</p><p>These cost dynamics land differently in the context of Nadella&#x27;s essay. He prescribes a three-layer architecture — evaluation, reinforcement learning, and retrieval — designed to sit between a company&#x27;s workforce and whatever frontier model it subscribes to. Companies, he argues, need to build &quot;private evals&quot; that &quot;capture whether a model is actually improving against outcomes that matter to the business (not just external benchmarks!),&quot; alongside &quot;private reinforcement learning environments&quot; that &quot;let models grow stronger on real traces from inside the organization&quot; and a knowledge base that &quot;makes institutional memory queryable and use of tokens more efficient.&quot; He calls the resulting system &quot;a hill climbing machine&quot; that, &quot;unlike most assets, it compounds.&quot;</p><h2>Other Big Tech CEOs are echoing Nadella&#x27;s fears about AI models devouring enterprise knowledge</h2><p>Nadella&#x27;s concerns do not exist in isolation. Other technology leaders have been raising similar warnings throughout 2026, though none have offered as prescriptive a response.</p><p>Snowflake CEO Sridhar Ramaswamy warned in a <a href="https://podcasts.apple.com/us/podcast/whos-winning-the-ai-race-softwares-future-with/id1522960417?i=1000749256704">February podcast</a> that the biggest software companies risk being reduced to mere data sources. &quot;The big model makers want to create a world in which all of the data for all of the enterprises is easily available to them,&quot; Ramaswamy said, describing everything else as &quot;a dumb data pipe that feeds into that big brain.&quot; He added that Snowflake needs to operate with a &quot;fear&quot; that enterprises would abandon software-specific AI agents in favor of all-inclusive agents that hoover up data from everywhere.</p><p>Box CEO Aaron Levie struck a similar note in a <a href="https://www.linkedin.com/feed/update/urn:li:activity:7414386514186498048/">January LinkedIn post</a>. AI models can now perform high-level knowledge work across nearly every profession, from law to strategy to scientific research, he argued. &quot;The question that we will have to wrestle with is, in a world where everyone has access to the same expert intelligence, how does a company differentiate?&quot; he wrote.</p><p>The combined effect of these statements is a shared diagnosis from three very different corners of the enterprise technology market: the current trajectory of AI development threatens to collapse competitive differentiation across entire industries. Nadella&#x27;s essay stands apart from the others because it moves beyond diagnosis and proposes a specific architectural remedy. But the prescription is impossible to separate from the prescriber&#x27;s interests.</p><p>Microsoft sits in precisely the platform layer that Nadella&#x27;s framework would make indispensable — the company builds its own frontier models, operates the cloud infrastructure those models run on, and maintains deep partnerships with the leading independent AI labs. A world in which every enterprise builds a proprietary learning loop on top of commodity foundation models is, conveniently, a world in which Microsoft sells the picks and shovels to all of them.</p><h2>Nadella&#x27;s Scout controversy and shareholder lawsuit reveal the tension inside Microsoft&#x27;s own AI strategy</h2><p>The essay also arrives just ten days after Nadella publicly rebuked one of his own executives for outlining a plan to &quot;<a href="https://nypost.com/2026/06/05/business/microsofts-satya-nadella-slams-company-exec-for-outlining-plan-to-make-people-addicted-to-scout-ai-tool/">make people addicted</a>&quot; to a new AI tool called Scout.. Microsoft corporate vice president Omar Shahine had written an internal memo describing a three-phase plan to transform Scout &quot;from addictive app to agentic platform,&quot; with the first phase focused on features that &quot;make people depend on it daily.&quot; Nadella responded on an internal message board: &quot;This is absolutely a non-goal! If anything we are doing the exact opposite. We want to make sure AI empowers and adds real value to human endeavor and broad economic growth!&quot;</p><p>The Scout incident and Sunday&#x27;s essay together suggest Nadella is actively constructing a public philosophy of AI that emphasizes broad value creation over extractive engagement — whether or not every corner of Microsoft has internalized that message. One anonymous Microsoft employee told 404 Media, as the Post reported, that the leaked Scout document was &quot;very troubling,&quot; adding: &quot;It feels like one of those &#x27;saying the quiet part out loud&#x27; moments.&quot;</p><p>For technical decision-makers evaluating Nadella&#x27;s essay, the practical implications are significant. He is arguing that choosing an AI model matters less than building the learning infrastructure around it. He is arguing that the ability to swap models without losing institutional intelligence is the critical test of AI sovereignty. And he is warning that companies that fail to build these systems will find their expertise absorbed and commoditized by the models themselves. &quot;You can offload a task, or even a job, but you can never offload your learning,&quot; Nadella writes. &quot;The future of the firm is the ability to compound that learning across people and AI.&quot;</p><h2>The question Nadella&#x27;s essay cannot answer is whether Microsoft will practice what its CEO preaches</h2><p>Whether Nadella&#x27;s vision materializes depends on a question his essay carefully sidesteps: whether the platform providers who build and host the frontier ecosystem will resist the temptation to capture the value flowing through it. Nadella insists that &quot;platforms enable more value on top than is captured inside.&quot; But Microsoft&#x27;s own trajectory this year — the ballooning capital expenditures, the Claude Code budget crisis, the shareholder lawsuit alleging concealed costs, the internal memo about making users addicted — suggests the economics of restraint are harder than the philosophy of restraint.</p><p>Nadella ends his essay with the claim that broad value distribution &quot;is the stable equilibrium we should build together.&quot; He may be right. Ecosystems have historically outperformed walled gardens over long time horizons. But stable equilibria require every major player to forgo short-term extraction in favor of long-term compounding — and right now, the AI industry is burning through budgets in four months and spending 66% more on infrastructure than analysts expected. The CEO of the world&#x27;s most valuable technology company has written an eloquent argument for why the AI economy needs to work differently. The open question is whether his own company&#x27;s balance sheet will let him prove it.</p><p>
</p>]]></description>
            <author>michael.nunez@venturebeat.com (Michael Nuñez)</author>
            <category>Technology</category>
            <category>Data</category>
            <category>Business</category>
            <enclosure url="https://images.ctfassets.net/jdtwqhzvc2n1/331duWoehbByzdXcqAME9g/c14f677c751a2ff2c9ddcbae149bc8e1/Nuneybits_Vector_art_of_golden_tokens_flooding_funnel_d3ce7ef3-0d50-4835-b3dd-de3ccfa6900e.webp?w=300&amp;q=30" length="0" type="image/webp"/>
        </item>
        <item>
            <title><![CDATA[When deep research isn't enough for your business: Sakana AI launches 'ultra deep research' agent for 100+ page reports in 8 hours]]></title>
            <link>https://venturebeat.com/technology/when-deep-research-isnt-enough-for-your-business-sakana-ai-launches-ultra-deep-research-agent-for-100-page-reports-in-8-hours</link>
            <guid isPermaLink="false">5Gw9F5DHLaGOM9s5bKswAC</guid>
            <pubDate>Mon, 15 Jun 2026 19:30:07 GMT</pubDate>
            <description><![CDATA[<p>Tokyo-based AI startup Sakana AI has officially launched its first commercial product, <a href="https://sakana.ai/marlin/">Sakana Marlin</a>. </p><p>Billed as a &quot;<a href="https://sakana.ai/marlin-release/#English">Virtual CSO</a>&quot; (Chief Strategy Officer), Marlin is an autonomous, B2B research agent that deliberately abandons the instantaneous text generation of modern chatbots in favor of deep, long-horizon reasoning. </p><p>What sets Marlin apart from the current ecosystem of AI tools is its temporal scale: instead of returning an answer in seconds, it runs continuous, self-governing reasoning loops for up to eight hours at a time to deliver deeply researched, well cited, 100-page strategy reports and executive slides. The company posted sample reports generated my Marlin on its product website <a href="https://sakana.ai/marlin/">here</a>.</p><p>Available immediately via the company’s website with pricing starting at a pay-as-you-go tier, the platform is designed strictly for enterprise use—specifically targeting corporations, financial institutions, and think tanks. </p><p>The generative AI hype cycle has largely been defined by speed. For the past two years, the industry standard has been the ability to generate a poem, a line of code, or a surface-level summary in mere milliseconds. But the enterprise frontier is rapidly shifting from shallow, rapid generation to deep, methodical reasoning. </p><p>With Marlin, major businesses are no longer asking how fast an AI can answer, but how deeply it can think.</p><h2><b>The Product: A Virtual CSO</b></h2><p>What exactly is a business getting when they deploy Sakana Marlin? The workflow is fundamentally different from typical large language model (LLM) interactions. Rather than engaging in a tedious back-and-forth prompt engineering session, the user simply provides a core research topic. Following a brief initial exchange to sharpen the scope and direction of the investigation, the human steps away entirely.</p><p>For the next several hours, Marlin operates as a self-contained digital strategy team. It formulates its own initial hypotheses, navigates the web to gather data, cross-references sources to verify findings, and maps the causal dynamics within complex business environments. It is effectively searching for the &quot;winning formula&quot; within a sea of noise.</p><p>Think of it less like a search engine and more like a junior strategy consultant locked in a room with a whiteboard and an internet connection. You provide the strategic prompt in the morning, and by the end of the workday, the system delivers a comprehensive, professional-grade portfolio. </p><p>In Marlin&#x27;s case, the final output is not a generic text blob; it is a structured set of strategic options, complete with executive summary slides, appendices, references, and a deeply researched report. </p><p>The company highlighted several real-world use cases to demonstrate Marlin&#x27;s capacity for complex synthesis, including generating detailed resolution scenarios for a theoretical blockade of the Strait of Hormuz, mapping out the fragmented global AI regulation patchwork, and analyzing macroeconomic trends like the return of &quot;bond vigilantes&quot;.</p><p>Sakana says Marlin relies on multiple AI models, but did not provide specific model names or providers. I&#x27;ve reached out on X to find out more and will update when I receive a repsonse.</p><h2><b>The Engine of Long-Horizon Reasoning</b></h2><p>Under the hood, Marlin is the commercial culmination of Sakana AI’s extensive laboratory breakthroughs over the past two years. </p><p>The product is powered by an exploration engine relying on Sakana&#x27;s own prior research breakthrough, <a href="https://sakana.ai/ab-mcts/">Adaptive Branching Monte Carlo Tree Search (AB-MCTS)</a>, and leverages frameworks derived from &quot;The AI Scientist,&quot; an earlier Sakana AI research project featured in the journal <i>Nature</i> that successfully automated the scientific discovery process from ideation to peer review.</p><p>To understand how this works in practice, consider a real-world analogy: modern chess engines. When a computer plays chess, it doesn&#x27;t just look at the board and guess; it plays out thousands of potential future moves, evaluating the strength of each resulting position before committing to an action. </p><p>Marlin’s AB-MCTS engine does something similar for research. </p><h2><b>Inside the Engine: The Mechanics of AB-MCTS</b></h2><p>The chronology of this technology traces back to June 2025, when Sakana AI first introduced the framework to the public alongside the research paper <i>“</i><a href="https://arxiv.org/pdf/2503.04412"><i>Wider or Deeper? Scaling LLM Inference-Time Compute with Adaptive Branching Tree Search</i></a><i>”</i>. </p><p>At that time, to encourage developer experimentation with collective AI intelligence, the company released the underlying algorithm as an open-source software library called <b>TreeQuest</b>, distributed under the permissive <b>Apache 2.0 license</b>. This open-source milestone laid the technical foundation for what would eventually evolve into the proprietary, enterprise-grade Marlin product a year later.</p><p>Traditionally, when developers attempt to extract higher-quality reasoning from large language models, they rely on a brute-force method called &quot;repeated sampling&quot;—essentially running the model dozens of times in parallel and hoping one of the answers is correct. However, repeated sampling operates blindly; it cannot evaluate its own intermediate steps or pivot based on external feedback.</p><p>AB-MCTS replaces this paradigm with a principled, multi-turn approach driven by a Bayesian decision framework. As the AI constructs a strategy report, the system treats the research process as a branching tree of possibilities. At each node of the tree, the algorithm dynamically balances two distinct behaviors based on external feedback signals:</p><ul><li><p><b>Going Wider (Exploration):</b> Spawning entirely new, alternative hypotheses or candidate responses when the current path yields diminishing returns or unresolved contradictions.</p></li><li><p><b>Going Deeper (Exploitation):</b> Methodically refining, auditing, and building upon an existing candidate solution that shows high strategic promise.</p></li></ul><p>What transforms this from a laboratory experiment into a commercial engine is its extension into <b>Multi-LLM AB-MCTS</b>. </p><p>Sakana AI’s architecture introduces a critical third dimension to the search tree: the ability to dynamically choose <i>which</i> model to invoke for a specific sub-task, treating the industry’s leading frontier models as a plug-and-play collective intelligence network.</p><p>According to technical documentation published by the company, the engine can coordinate highly heterogeneous models—allowing an orchestration model to delegate initial ideation to one LLM, while utilizing a reasoning-heavy model to audit, verify, and correct intermediate errors generated earlier in the search tree.</p><p>By scaling up compute at inference time—leveraging the distinct &quot;personalities&quot; and strengths of multiple foundation models over thousands of automated cycles—AB-MCTS provides the mathematical guardrails Marlin requires. It ensures that the resulting 100-page strategy reports are not merely long-winded AI generations, but the highly vetted product of systemic, automated trial-and-error.</p><h2><b>Licensing, Data, and Enterprise Implications</b></h2><p>It is crucial to note that Sakana Marlin is distinctly not a general consumer tool; it is a commercial software-as-a-service (SaaS) offering restricted to corporate entities, organizations, and sole proprietors.</p><p>For enterprises, licensing and data handling terms are often the determining factors in software adoption. Unlike many consumer-grade AI tools that silently harvest user inputs and proprietary data to train future foundational models, Sakana Marlin operates under a strict, enterprise-grade data policy. </p><p>Neither Sakana AI nor its external AI service providers will use customer data or inputs for model training or fine-tuning unless the client provides explicit opt-in consent. </p><p>Even with consent, data is heavily processed to remove personally identifiable information. This closed-loop security is absolutely vital for companies handling sensitive M&amp;A research, unreleased product strategies, or proprietary market analyses.</p><p>The commercial licensing is structured into tiered pricing models that reflect its enterprise nature:</p><ul><li><p><b>Pay-as-you-go:</b> Users can purchase credits on demand, with a single run costing 100 credits, and add-on credits priced at ¥98 ($0.61 USD) each.</p></li><li><p><b>Pro Plan:</b> At ¥150,000 ($935.68 USD) per month, businesses receive 2,000 credits, bringing down the cost of add-on credits to ¥90 ($0.56 USD).</p></li><li><p><b>Team Plan:</b> Geared toward larger departments, this ¥400,000 ($2,495.14 USD) per month tier includes 6,000 credits, lowering add-on costs to ¥85 ($0.53 USD) per credit.</p></li><li><p><b>Enterprise:</b> Fully custom quotes with dedicated support and customized credit allocations.</p></li></ul><h2><b>Why Sakana Is Worth Watching</b></h2><p>Sakana AI’s transition into a commercial enterprise powerhouse is rooted in the pedigree of its founders, who famously helped spark the current generative AI boom. </p><p><a href="https://venturebeat.com/ai/what-you-need-to-know-about-sakana-ai-the-new-startup-from-a-transformer-paper-co-author">Formed in Tokyo in 2023</a>, the startup was co-founded by Llion Jones—a co-author of Google’s seminal 2017 “Attention Is All You Need” paper who coined the term “transformer”—and David Ha, a former Google Brain researcher and head of research at Stability AI. </p><p>The decision to build a new laboratory outside the Silicon Valley bubble was a deliberate rejection of the current AI ecosystem. At a TED AI conference in late 2025, <a href="https://venturebeat.com/technology/sakana-ais-cto-says-hes-absolutely-sick-of-transformers-the-tech-that-powers">Jones candidly expressed that he was &quot;absolutely sick&quot; of transformers</a>, warning that the intense pressure from investors and the hyper-fixation on scaling single, monolithic models had calcified the industry&#x27;s creativity and blinded researchers to the next major breakthrough.</p><p>To break free from this &quot;big company-itis,&quot; Jones and Ha structured Sakana AI around principles of biomimicry and evolutionary computing. </p><p>The company&#x27;s name, derived from the Japanese word for fish, reflects its core technical philosophy: leveraging collective intelligence similar to schools of fish, ant colonies, or insect swarms. Rather than attempting to build one massive, do-it-all foundation model, Sakana’s research has consistently focused on deploying networks of smaller, specialized models that collaborate dynamically to adapt to complex environments. </p><p>This philosophy posits that by treating individual AI models as members of a &quot;dream team&quot; with complementary strengths, systems can achieve more robust and cost-effective reasoning than relying on sheer scale alone.</p><p>This nature-inspired approach quickly yielded dividends in rigorous, competitive testing. Sakana AI has made significant strides in &quot;inference-time scaling&quot;—allocating computational resources during the problem-solving phase to allow models to think, iterate, and refine their own answers over extended periods. </p><p>In early 2026, the company’s<a href="https://sakana.ai/ahc058/"> ALE-Agent took first place in the highly complex AtCoder Heuristic Contest (AHC058),</a> a combinatorial optimization challenge, outperforming over 800 top-tier human programmers by autonomously rebuilding and testing hundreds of solutions over a four-hour window. </p><p>Similarly,<a href="https://venturebeat.com/orchestration/how-sakana-trained-a-7b-model-to-orchestrate-gpt-5-claude-sonnet-4-and-gemini-2-5-pro"> Sakana introduced &quot;RL Conductor,&quot;</a> a small 7-billion-parameter model trained via reinforcement learning specifically to orchestrate and delegate tasks among a diverse pool of worker models—ranging from GPT-5 to Claude Sonnet 4—achieving state-of-the-art results on reasoning benchmarks at a fraction of traditional computing costs.</p><p>Sakana&#x27;s rapid evolution from a disruptive research lab to a commercial software provider has attracted intense attention from global financial heavyweights. </p><p>By late 2025, the Tokyo-based startup secured a massive <a href="https://techcrunch.com/2025/11/17/sakana-ai-raises-135m-series-b-at-a-2-65b-valuation-to-continue-building-ai-models-for-japan/">Series B funding round that pushed its post-money valuation past $2.6 billion</a>, cementing its status as one of Japan’s most highly valued private tech companies. The firm boasts a sprawling roster of strategic investors, including early venture backers Khosla Ventures, Lux Capital, and New Enterprise Associates (NEA), alongside industry titans like Nvidia and Google. </p><p>As Sakana has expanded its focus toward mission-critical sectors like defense and finance, it has also drawn investments from major global banking institutions like Mitsubishi UFJ Financial Group (MUFG) and Citi, as well as enterprise tech giant Salesforce, positioning the startup to actively reshape corporate AI infrastructure from the ground up.</p><h2><b>Community Reactions and Field Testing</b></h2><p>Sakana AI’s shift toward commercial, long-horizon agents did not happen in a vacuum. The company ran a rigorous closed beta test beginning in April 2026, putting the tool in the hands of approximately 300 professionals across financial institutions, consulting firms, and think tanks. The feedback underscores a stark qualitative difference between standard generative chatbots and Marlin’s autonomous, fact-driven approach.</p><p>A senior consultant at a major Tokyo consulting firm noted that the tool &quot;exceeded expectations by discovering angles we hadn&#x27;t even imagined,&quot; praising its ability to match human comprehensiveness while stripping away human bias. Meanwhile, a cybersecurity division at a major Japanese IT system integrator lauded the system for providing &quot;a highly convincing report driven by high-quality, primary research,&quot; rather than relying on recycled secondary sources.</p><p>On social media, the company’s announcement resonated with the broader tech community&#x27;s growing appetite for autonomous agents. </p><p>As the AI industry matures, the value proposition is clearly shifting. Tools that act as fast, conversational encyclopedias are becoming commoditized. With Sakana Marlin, the focus moves entirely to separating the heavy lifting of thinking from the final act of deciding. By delegating the exhaustive mapping of causal dynamics to an agent capable of sustained reasoning, human executives are free to do what they do best: take action.</p>]]></description>
            <author>carl.franzen@venturebeat.com (Carl Franzen)</author>
            <category>Technology</category>
            <enclosure url="https://images.ctfassets.net/jdtwqhzvc2n1/5StCW6O4h5oCFhnr4NlI2v/1738378131f3115fdb0ecfa61fbfebe2/ChatGPT_Image_Jun_15__2026__03_10_43_PM.png?w=300&amp;q=30" length="0" type="image/png"/>
        </item>
        <item>
            <title><![CDATA[85% of IT teams claim every AI agent is under control. Only 42% actually know who owns them.]]></title>
            <link>https://venturebeat.com/security/85-of-it-teams-claim-every-ai-agent-is-under-control-only-42-actually-know-who-owns-them</link>
            <guid isPermaLink="false">3wCVYKClI7OE7oMOioSdO1</guid>
            <pubDate>Mon, 15 Jun 2026 17:19:28 GMT</pubDate>
            <description><![CDATA[<p>Organizational leaders are <a href="https://www.ivanti.com/resources/research-reports/scaling-ai-it-operations">nearly twice as likely to hide their AI use</a> compared to all other employees, at 42% versus 23%, according to new Ivanti research surveying 3,900 employees across six countries. Among leaders who conceal that usage, 52% say they do it for a &quot;secret advantage.&quot; The same research found 85% of IT professionals claim a named owner exists for every AI agent. Only 42% say ownership is actually clear — a 43-point gap that no governance framework was designed to close.</p><p>Sam Evans, CISO of Clearwater Analytics, stood before his board and laid out the risk to the $8.8 trillion in assets his firm&#x27;s platform supports. &quot;The worst possible thing would be one of our employees taking customer data and putting it into an AI engine that we don&#x27;t manage,&quot; <a href="https://venturebeat.com/security/ciso-dodges-bullet-protecting-8-8-trillion-from-shadow-ai">Evans told VentureBeat</a>. He brought a solution, not just a problem. Many CISOs VentureBeat interviewed did not.</p><p>Menlo Security CEO Bill Robbins relayed a conversation with a Top 3 U.S. bank CISO who called shadow AI discovery &quot;a bit of a fool&#x27;s errand&quot;: AI is embedded in every application and browser employees touch. The bank governs from containment, not discovery.</p><p>The scale justifies that posture. &quot;We see 50 new AI apps a day, and we&#x27;ve already cataloged over 12,000,&quot; Prompt Security CEO Itamar Golan <a href="https://venturebeat.com/security/shadow-ai-unapproved-ai-apps-compromising-security-what-you-can-do-about-it">told VentureBeat</a>. &quot;Around 40% of these default to training on any data you feed them, meaning your intellectual property can become part of their models.&quot; CrowdStrike has detected <a href="https://venturebeat.com/security/rsac-2026-agent-identity-frameworks-three-gaps">1,800 AI applications operating</a> across 160 million endpoint instances. Those are vendor-reported numbers from proprietary telemetry. No independent party can verify them. The directional signal matters more than the exact count.</p><p>CrowdStrike CTO Elia Zaitsev described what makes the surface so hard to govern. &quot;It looks indistinguishable if an agent runs your web browser versus if you run your browser,&quot; Zaitsev <a href="https://venturebeat.com/security/cisco-crowdstrike-rsac-2026-agent-identity-iam-gap-maturity-model">told VentureBeat at RSAC 2026</a>. &quot;Observing actual kinetic actions is a structured, solvable problem. Intent is not.&quot; The shadow AI surface is no longer a list security teams can maintain. It is an environment they have to assume.</p><p>The Ivanti survey was administered independently by Ravn Research and MSI Advanced Customer Insights across 1,500 IT professionals. Among companies with AI policies, just 24% of employees say those policies are followed &quot;very consistently&quot; in day-to-day work.</p><p>Kayne McGladrey, IEEE senior member, told VentureBeat why that governance gap persists. &quot;Anything that seems to have a cybersecurity flavor is generally put into the cybersecurity risk category, which is a complete fiction. They should be focused on business risks, because if it doesn&#x27;t affect the business, like a financial loss, then nobody&#x27;s going to pay attention to it, and they will not budget it appropriately, nor will they adequately put in controls to prevent it,&quot; McGladrey told VentureBeat previously.</p><p>Brokerage partners at major consulting firms shared over Signal that <a href="https://venturebeat.com/security/vibe-coded-apps-shadow-ai-s3-bucket-crisis-ciso-audit-framework">they build shadow AI applications in Google Colab</a> and store them in S3 buckets to compress a week of financial analysis into an hour. The approval process takes too long, so they route around it.</p><h2>Governance at deploy time, failure at runtime</h2><p>Reviews check functional requirements when a model ships, but they never check model provenance, behavioral drift, or whether the agent expanded its own permissions after launch. </p><p>CrowdStrike CEO George Kurtz <a href="https://venturebeat.com/security/rsac-2026-agent-identity-frameworks-three-gaps">disclosed at RSA Conference 2026</a> that a Fortune 50 CEO&#x27;s AI agent rewrote the company&#x27;s security policy to expand its own autonomy. The company caught it by accident. Every credential check had passed. &quot;In the agentic era, defending against AI-accelerated adversaries and securing AI systems themselves require operating at machine speed,&quot; <a href="https://venturebeat.com/security/adversaries-hijacked-ai-security-tools-at-90-organizations-the-next-wave-has-write-access-to-the-firewall">Kurtz said</a>. Quarterly governance reviews do not operate at machine speed.</p><p>Mike Riemer, Field CISO at Ivanti, built that lesson into his own team&#x27;s AI agent development. &quot;It&#x27;s great at what I intended it for, but it&#x27;s also great at what I didn&#x27;t intend it for, and what I didn&#x27;t intend it for is dangerous,&quot; <a href="https://venturebeat.com/security/most-enterprises-cant-stop-stage-three-ai-agent-threats-venturebeat-survey-finds">Riemer told VentureBeat</a>.</p><p>Hallucination data compounds the problem. Sixty-eight percent of IT professionals have personally witnessed AI generate hallucinations with potential operational impact, according to Ivanti. More than half caught the errors before damage, but 16% did not. Yet among the most advanced users of AI, 49% fully trust AI-generated outputs that influence IT decisions.</p><p>Riemer described the pattern <a href="https://venturebeat.com/security/mfa-verifies-who-logged-in-it-has-no-idea-what-they-do-next">in an exclusive interview with VentureBeat</a>. &quot;There are people that are just accepting what&#x27;s been given to them without any full understanding of what it is doing, which we&#x27;ve found in the tech industry for decades,&quot; Riemer said. &quot;They don&#x27;t question how it&#x27;s doing it. They just start gauging it by its outcome.&quot;</p><p>Qualtrics CSO Assaf Keren identified the core tension in an exclusive interview with VentureBeat. Organizations are introducing &quot;non-deterministic decisioning into environments built for deterministic.&quot; Keren cited internal Qualtrics data showing that 22% of SOC triage is now AI-driven. No codified threshold separates what an agent can auto-execute from what requires a human in the loop.</p><h2>The 18-month window</h2><p>The window for fixing this is closing. IT organizations expect AI to <a href="https://www.ivanti.com/resources/research-reports/scaling-ai-it-operations">automate 46% of their operations within 18 months</a>, according to Ivanti. U.S. companies project 52%. Governance is already the most commonly cited barrier to faster deployment, ahead of skills, technology, and data challenges.</p><p>The maturity divide makes the governance gap more dangerous. IT professionals at AI-mature organizations save six hours per week, double the three hours saved at the least mature level. Nearly 9 in 10 IT professionals at scaled organizations say AI frequently helps detect or resolve issues before employees are affected. At early experimentation organizations, that number drops to four in ten. Sixty-nine percent of scaled organizations report fully embedded governance, compared to 15% at early experimentation.</p><p>Cisco President Jeetu Patel <a href="https://venturebeat.com/security/85-of-enterprises-are-running-ai-agents-only-5-trust-them-enough-to-ship">walked through a hypothetical scenario</a> in an interview at RSAC 2026: an agent that charges $40,000, invites competitors to a Slack channel, and publishes home addresses. &quot;The apology is not a guardrail,&quot; Patel told VentureBeat. </p><p>Cato Networks VP of Threat Intelligence Etay Maor <a href="https://venturebeat.com/security/rsac-2026-agent-identity-frameworks-three-gaps">framed the accountability problem in a separate RSAC</a> interview. &quot;They&#x27;re closer to humans. Why are we not doing background checks on agents?&quot;</p><p>&quot;AI is compressing the time between intent and execution while turning enterprise AI systems into targets,&quot; CrowdStrike VP of Intelligence Operations Adam Meyers told VentureBeat. </p><p>&quot;Proceed on one action does not mean proceed on the next,&quot; Cisco SVP of AI Software and Platform DJ Sampath said in a separate interview. </p><p>McGladrey described the root cause. Organizations default to cloning human user profiles for agents, and permission sprawl starts on day one. &quot;It uses far more permissions than it should have, more than a human would, because of the speed of scale and intent,&quot; he said.</p><p>Riemer&#x27;s team built governance into Ivanti&#x27;s own development process. &quot;We have AI check on top of AI to make sure that it is fixed. Two different models, two different manufacturers,&quot; <a href="https://venturebeat.com/security/most-enterprises-cant-stop-stage-three-ai-agent-threats-venturebeat-survey-finds">Riemer said</a>. &quot;If one AI believes the other AI fixed it appropriately, then it passes it off to a human being.&quot;</p><p>Riemer put the vendor question in terms every CISO can use at the negotiating table. &quot;If that vendor doesn&#x27;t have a way to show you what they&#x27;ve done from a development perspective in order to improve their development processes, you really need to question why you&#x27;re working with that vendor,&quot; he said.</p><p>The six questions below target governance dimensions where enforcement collapses at runtime. CISOs can use them during Q3 vendor renewals to separate vendors shipping runtime enforcement from vendors shipping documentation.</p><h2>Six governance questions for Q3 renewals</h2><table><tbody><tr><td><p><b>Governance dimension</b></p></td><td><p><b>What the data proved</b></p></td><td><p><b>Why governance misses it</b></p></td><td><p><b>Q3 renewal question</b></p></td><td><p><b>Proof artifact to demand</b></p></td></tr><tr><td><p><b>Executive shadow AI</b></p></td><td><p>Leaders hide AI at 42% vs. 23% all employees. 52% hide for &quot;secret advantage.&quot; Regulated industries have the highest unsanctioned rates.</p></td><td><p>Governance assumes policy writers follow policy. Leaders sit above the controls they wrote.</p></td><td><p>Can your DLP, browser, SSE, and endpoint telemetry detect AI data movement at the executive layer with the same coverage as all other users?</p></td><td><p>Executive-layer DLP, browser, SSE, and endpoint telemetry logs showing identical coverage to all other users.</p></td></tr><tr><td><p><b>Named agent ownership</b></p></td><td><p>85% claim a named owner. Only 42% say ownership is clear. 43-point gap.</p></td><td><p>Owner on a spreadsheet. Agent at runtime. Nobody tested whether the owner can kill the agent under load.</p></td><td><p>Can you name the owner for every AI agent? Can that owner revoke access in 60 seconds?</p></td><td><p>Live demo of 60-second agent access revocation under production load.</p></td></tr><tr><td><p><b>Pre-deployment review</b></p></td><td><p>65% have pre-deployment risk review. Separately, only 24% say any AI policy is followed &quot;very consistently.&quot; Review exists. Enforcement does not.</p></td><td><p>Review checks functional requirements at deploy. Never checks model provenance or behavioral drift at runtime.</p></td><td><p>Does your review cover model provenance? Is it enforced or advisory?</p></td><td><p>Model provenance certificate with enforcement log showing blocked deployments.</p></td></tr><tr><td><p><b>Policy enforcement</b></p></td><td><p>58% have acceptable-use policies. 24% followed &quot;very consistently.&quot; Documented. Not practiced.</p></td><td><p>Agent pursued its goal past every boundary. Goal-seeking does not stop at a document the model never reads.</p></td><td><p>Are policies enforced by server-side gates or by agent compliance? What percentage of actions are gated?</p></td><td><p>Server-side gate audit trail with percentage of agent actions gated vs. ungated.</p></td></tr><tr><td><p><b>Trust thresholds</b></p></td><td><p>68% have seen hallucinations with operational impact. 49% of advanced users fully trust outputs.</p></td><td><p>No codified threshold separates auto-execute from human-review.</p></td><td><p>Which agent actions auto-execute versus require human review? Is that enforced in policy or in the platform?</p></td><td><p>Documented threshold matrix classifying every agent action as auto-execute or human-review.</p></td></tr><tr><td><p><b>Per-action authorization</b></p></td><td><p>Governance is the #1 barrier at 27%. Skills 20%. Tech 17%. Data 14%.</p></td><td><p>Oversight reviews quarterly. Agents act per-second.</p></td><td><p>Is per-action authorization enforced at runtime or only at deploy-time review? Can agents accumulate permissions without re-authorization?</p></td><td><p>Runtime authorization log showing per-action gate events and permission re-authorization timestamps.</p></td></tr></tbody></table><p><i>Source data from Ivanti, </i><a href="https://www.ivanti.com/resources/research-reports/scaling-ai-it-operations"><i>Scaling AI in IT Operations: The Path to Maturity in 2026</i></a><i> (n=1,500 IT professionals, 3,900 total employees, six countries, February–March 2026). Exclusive CISO sourcing by VentureBeat.</i></p><p>Evans put structure around the Clearwater board conversation. The bank CISO that Robbins described assumed AI is everywhere and governed from containment instead of discovery. Governance that tries to catalog every shadow AI tool will fail because the surface grows faster than any inventory.</p><p>At scaled, business-critical organizations, 54% of IT professionals say AI makes their work both faster and better, according to Ivanti. At early experimentation organizations, 24% say the same. At scaled organizations, accountability lives in the platform. At early ones, it lives in a document the agent never reads.</p><p>The six questions above give every CISO a way to test whether their governance actually works where it matters. At runtime, under load, and before the next renewal check clears.</p>]]></description>
            <author>louiswcolumbus@gmail.com (Louis Columbus)</author>
            <category>Security</category>
            <enclosure url="https://images.ctfassets.net/jdtwqhzvc2n1/VVD0axSaQb70MPEzSbZms/5dfd23c2b3da09d025fbc393412549c9/hero.png?w=300&amp;q=30" length="0" type="image/png"/>
        </item>
        <item>
            <title><![CDATA[Vibe coding can build your pipeline. It can't explain it six months later]]></title>
            <link>https://venturebeat.com/orchestration/vibe-coding-can-build-your-pipeline-it-cant-explain-it-six-months-later</link>
            <guid isPermaLink="false">6CJpbIEfAj9Ct96r8ZFN8o</guid>
            <pubDate>Mon, 15 Jun 2026 15:14:57 GMT</pubDate>
            <description><![CDATA[<p>AI coding agents are rapidly accelerating data engineering by generating transformations, pipelines, orchestration workflows, validation tests, and infrastructure configurations from prompts. </p><p>However, enterprise data platforms have long operated across fragmented systems owned by different teams and built on different technologies. As these systems evolve independently, organizations increasingly struggle with inconsistent business logic, duplicated implementations, difficult downstream impact analysis, and hidden dependencies across the platform. </p><p>The rise of vibe coding can further amplify these problems as more operational context, architectural decisions, and business knowledge become scattered across prompts, conversations, generated code, and disconnected workflows rather than becoming part of the system itself.</p><p>Spec-driven development (SDD) is emerging as one approach to address this challenge. In SDD, prompts, business rules, validation logic, orchestration behavior, and implementation workflows are converted into executable and versioned specifications that become part of the system itself. These specifications act as persistent operational memory for both humans and <a href="https://venturebeat.com/orchestration/mcp-solved-tool-calling-a2a-solved-coordination-what-solves-transport">AI agents</a>, allowing systems to evolve more consistently across releases, teams, and AI-assisted workflows.</p><p>Because enterprise data engineering already relies heavily on reusable patterns, metadata-driven pipelines, and standardized operational workflows, it is especially well-suited for SDD. By combining AI-assisted generation with deterministic and reusable system contracts, SDD may provide a new operational layer for reducing fragmentation and improving long-term coordination across increasingly AI-generated data platforms.</p><h2><b>Vibe coding alone lacks persistent system memory </b></h2><p>Vibe coding works remarkably well for generating isolated implementations quickly. But prompts are inherently temporary. They capture an engineer’s assumptions, business context, implementation logic, and system knowledge only for that specific conversation and moment in time.</p><p>In practice, making <a href="https://venturebeat.com/technology/agentic-ai-solved-coding-and-exposed-every-other-problem-in-software-engineering">AI-generated systems</a> work often requires far more than a simple prompt. Engineers continuously provide background information, architectural decisions, business rules, schema assumptions, downstream dependencies, operational constraints, debugging history, and implementation guidance throughout the development process.</p><p>These contexts become the real operational knowledge behind AI-assisted development.</p><p>However, in most vibe coding workflows, this information remains scattered across prompts, conversations, Jira tickets, documentation, chat history, generated code, and disconnected workflows rather than becoming part of the system itself.</p><p>This creates a major problem for enterprise data engineering because modern data platforms are naturally fragmented across many interconnected systems, including ingestion pipelines, warehouses, orchestration frameworks, semantic layers, APIs, dashboards, and machine learning (ML) systems. As more logic and context become embedded inside prompts and generated implementations, organizations gradually lose visibility into:</p><ul><li><p>architectural intent</p></li><li><p>downstream dependencies</p></li><li><p>validation assumptions</p></li><li><p>operational behavior</p></li><li><p>business context behind implementations</p></li></ul><p>Over time, the system itself no longer contains the full reasoning behind how it was built. Critical business context, architectural assumptions, and operational knowledge still largely exist inside human judgement and scattered conversations rather than inside the platform itself. </p><p>Vibe coding makes implementation significantly faster, but from a system perspective, overall engineering efficiency does not improve proportionally because much of the development lifecycle still depends on human validation, domain knowledge, coordination, and decision-making.</p><p>More importantly, prompts are not naturally iterable engineering artifacts. Enterprise systems continuously evolve across releases, schema changes, business logic updates, and downstream dependencies. Teams repeatedly revisit and refine systems over time, but prompts are optimized for fast local generation rather than system long-term evolution.</p><p>They are difficult to:</p><ul><li><p>version consistently</p></li><li><p>validate systematically</p></li><li><p>reuse across teams</p></li><li><p>coordinate through CI/CD workflows</p></li><li><p>evolve incrementally over time</p></li></ul><p>Even the same prompt may not reliably generate the same implementation with different context in the future.</p><p>This is where SDD begins to move to the center of AI-assisted data engineering. Instead of leaving operational knowledge scattered across prompts and conversations, SDD integrates business context, validation logic, transformation behavior, orchestration requirements, and implementation workflows directly into executable specifications that become part of the system itself.</p><p>The system now has persistent memory about how it was designed, why certain decisions were made, and how different components are connected across the platform. This allows teams and <a href="https://venturebeat.com/orchestration/when-claude-changed-everything-changed-managing-ai-blast-radius-in-production">AI agents</a> to iterate systems more reliably over time while reducing fragmentation across increasingly distributed data environments.</p><h2><b>Spec-driven development turns prompts into system memory</b></h2><p>In SDD, systems are built around executable specifications rather than loosely coordinated prompts and implementations alone. Instead of treating specifications as passive documentation written after development, SDD treats them as operational contracts that directly drive code generation, validation, testing, orchestration, and deployment workflows.</p><p>In many ways, SDD extends ideas from Infrastructure-as-Code and GitOps into AI-assisted engineering. Specifications combine declarative system definitions with executable implementation workflows. The declarative layer provides system context, schemas, dependencies, constraints, and operational requirements, while workflow-oriented instructions guide AI agents on how to implement and evolve the system consistently.</p><p>Once these contexts, rules, and implementation patterns are converted into persistent and versioned contracts stored in repositories and integrated into CI/CD workflows, the system becomes significantly more iterable and governable over time. These specifications effectively become long-term system memory for both humans and AI agents, allowing systems to evolve consistently across releases, teams, and increasingly AI-assisted development workflows.</p><p>In practice, the structure of specifications largely depends on the type of systems and workflows being implemented. However, spec-driven systems often begin with a foundational “constitution” that defines project-wide principles and constraints that should remain consistent across the platform, such as technology standards, naming conventions, architectural rules, governance policies, and core system requirements. On top of this foundation, multiple layers of specifications serve different operational purposes across the development lifecycle:</p><ul><li><p>schema specifications define structural compatibility</p></li><li><p>transformation specifications define business logic</p></li><li><p>validation specifications define quality rules</p></li><li><p>orchestration specifications define execution behavior</p></li><li><p>semantic specifications define shared business definitions</p></li><li><p>AI workflow specifications define reusable implementation instructions for coding agents</p></li></ul><p>A simplified specification might look like this:</p><p><i>pipeline_spec:</i></p><p><i>  source:</i></p><p><i>    system: mysql</i></p><p><i>    table: order</i></p><p><i>  transformation:</i></p><p><i>    logic:</i></p><p><i>      - load_strategy: scd2</i></p><p><i>  target:</i></p><p><i>    platform: snowflake</i></p><p><i>    table: dim_order</i></p><p><i>  validation:</i></p><p><i>    primary_key: order_id</i></p><p>Additional workflow files can then provide reusable implementation instructions for coding agents:</p><ol><li><p>Generate Python ingestion code for Salesforce customer data.</p></li><li><p>Generate DBT models implementing Type 2 SCD logic.</p></li><li><p>Generate Airflow workflows for hourly execution.</p></li><li><p>Generate validation tests for downstream compatibility.</p></li></ol><p>These specification documents are often maintained as markdown-based operational artifacts generated and refined through AI-assisted workflows. Engineers can iteratively update the specifications, provide additional business context, and collaborate with coding agents to improve implementation logic, workflows, and prompt instructions over time. Compared to traditional documentation processes, AI-assisted specification generation is significantly faster and more adaptive.</p><p>The important shift is not simply better documentation. Specifications become reusable operational context that allows systems to evolve consistently across releases, teams, and AI-assisted workflows. Architectural intent, business assumptions, and implementation logic no longer disappear into temporary prompts and disconnected implementations, but instead become persistent system knowledge integrated directly into the development lifecycle.</p><h2><b>Why spec-driven development specifically fits data engineering </b></h2><p>SDD can theoretically be applied across many areas of software engineering, but data engineering is especially well-suited for this model because of the nature of modern data platforms.</p><p>Enterprise data systems naturally span many interconnected technologies and layers, including transactional systems, ingestion frameworks, streaming platforms, warehouses, orchestration systems, semantic layers, APIs, dashboards, and ML pipelines. Data engineers regularly work across long technology stacks and distributed systems where a single upstream change can impact many downstream consumers.</p><p>Enterprise data platforms also support many different teams and applications across fragmented environments. As systems evolve independently, understanding the full downstream impact of an upstream schema or business logic change becomes increasingly difficult. A seemingly small modification can silently break downstream pipelines, dashboards, APIs, semantic models, or machine learning workflows across the platform.</p><p>SDD can address this fragmentation by introducing shared and versioned operational contracts across systems. Because schemas, dependencies, validation rules, transformation logic, and orchestration behavior are explicitly defined within specifications, teams and AI agents gain much better visibility into how systems are connected and how changes propagate across the platform.</p><p>Additionally, the goal of data engineering is not simply delivering pipelines quickly. Teams must also optimize for system stability, scalability, consistency, maintainability, operational reliability, and infrastructure cost.</p><p>This requires significant system and solution design work from engineers. Teams must define tech stack, create schemas, transformation patterns, orchestration behavior, validation rules, storage strategies, and downstream compatibility requirements carefully across the platform.</p><p>However, once these architectural and operational patterns are established, much of the implementation work becomes highly repetitive and standardized.</p><p>For example, after defining a reusable ingestion and transformation pattern for Salesforce customer data, onboarding a new table may only require adding another table definition into the specification, while the remaining implementation can be generated automatically through existing specifications and workflows that follow the same operational pattern:</p><p><i>source:</i></p><p><i>  system: salesforce</i></p><p><i>  tables:</i></p><p><i>    - customer</i></p><p><i>    - order</i></p><p><i>    - product</i></p><p>From this specification alone, coding agents could generate new data pipelines following the same governed implementation pattern across the platform. This combination of human-driven architectural design and highly repeatable implementation workflows makes data engineering particularly suitable for SDD.</p><p>In many ways, data engineering has always been moving toward higher levels of automation, from ETL frameworks and metadata-driven pipelines to IaC and declarative orchestration systems. SDD represents another step in that evolution by combining prompt-based AI generation with deterministic and versioned operational contracts.</p><p>Instead of relying entirely on temporary conversational prompts or rigid template systems, SDD introduces a middle layer where reusable specifications provide structure, coordination, validation, and persistent system memory for AI-assisted development.</p><h2><b>How SDD changes AI-assisted data engineering</b></h2><p>SDD introduces a much higher level of automation into enterprise data engineering while also helping reduce the fragmentation problems that modern data platforms increasingly face.</p><p>Because schemas, business rules, transformation behavior, orchestration requirements, validation logic, and downstream dependencies are explicitly defined inside reusable specifications, coding agents can generate and evolve large portions of the implementation consistently across the platform. Instead of repeatedly rebuilding pipelines and workflows from temporary prompts and disconnected context, teams can iterate systems through shared operational contracts and reusable implementation patterns.</p><p>This significantly improves consistency, traceability, and coordination across distributed environments. Schema evolution becomes easier to manage, downstream impact becomes more visible, and systems can evolve incrementally instead of through disconnected generations of implementations.</p><p>At the same time, human engineers still remain essential in the development lifecycle. While AI agents can automate large portions of implementation work, human judgement is still critical for defining business logic, designing architectures, managing tradeoffs, validating correctness, and coordinating system evolution across organizations.</p><p>As more implementation work becomes AI-generated, the role of data engineering also begins shifting. Engineers spend less time writing repetitive pipelines and orchestration logic, and more time defining specifications, designing reusable operational patterns, managing validation rules, and coordinating business context across systems.</p><p>This may also gradually reduce some of the traditional boundaries between different data engineering teams. Because implementation becomes increasingly standardized and AI-assisted through shared specifications, organizations may rely less on highly siloed platform-specific implementation teams and more on shared operational contracts and reusable system patterns.</p><p>Ultimately, SDD shifts data engineering toward a more specification-oriented and system-oriented model where humans focus on intent, architecture, and business coordination, while AI agents increasingly handle implementation, testing, and operational generation at scale.</p><p><i>Shuhua Xu is a lead data engineer.</i></p>]]></description>
            <category>Orchestration</category>
            <category>DataDecisionMakers</category>
            <enclosure url="https://images.ctfassets.net/jdtwqhzvc2n1/3w8NFsY3AuOh3mKS6s64l1/8990e9c4c720c94fb1c9e8a532af43aa/Engineering.png?w=300&amp;q=30" length="0" type="image/png"/>
        </item>
        <item>
            <title><![CDATA[Attackers scale deception with AI. Defenders need truth at machine speed.]]></title>
            <link>https://venturebeat.com/security/attackers-scale-deception-with-ai-defenders-need-truth-at-machine-speed</link>
            <guid isPermaLink="false">51laSPpskSEetQ7nH4vhNO</guid>
            <pubDate>Mon, 15 Jun 2026 07:00:00 GMT</pubDate>
            <description><![CDATA[<p><i>Presented by Splunk</i></p><hr/><p>AI has changed the economics of cyber deception.</p><p>An attacker can now generate thousands of convincing phishing lures, fake identities, and tailored pretexts before a defender finishes a single change-control cycle. That is the new security challenge: deception got faster and cheaper, while verification did not.</p><p>Much of the discussion around AI for defense centers on detection models. Detection matters, but it is not the only bottleneck. The deeper constraint is evidence: where data lives, whether it is available when needed, how quickly it can be correlated, how long it is retained, and whether analysts or agents can trust what they retrieve.</p><p>Defense in the AI era is a data problem before it is a detection problem.</p><h2>The defender’s advantage is truth</h2><p>Attackers can afford to lie at enterprise scale. They can test endless combinations of messages, identities, domains, and attack paths, and most can fail at almost no cost.</p><p>Defenders do not have that luxury. Their advantage is truth: quickly knowing what happened, where, when, which identity was involved, which assets were affected, what changed, and what business process may be at risk.</p><p>That truth must be documented, governed, auditable, and defensible. Attackers are using AI to scale deception, impersonation, social engineering, and speed. Defenders need AI to scale verification.</p><p>The goal is not just to act faster than the attacker. It is to take action that people and machines can trust.</p><h2>Fragmented data breaks modern defense</h2><p>Consider a suspicious login from a contractor account. On its own, it is just another authentication anomaly. To know whether it matters, a security team may need identity history, endpoint activity, cloud access logs, ticketing records, asset ownership, configuration changes, network telemetry, and business context.</p><p>If those records sit in different tools, expire at different times, or require multiple teams to retrieve, defenders are not investigating the incident. They are negotiating with their own data estate.</p><p>When signals can be reached in place and correlated quickly, the issue is no longer just whether the login looks unusual. It becomes whether the enterprise has enough evidence, in enough context, to take action it can defend.</p><p>That challenge grows more urgent with AI assistants and agents. AI can only reason over what it can retrieve in time to matter. If the data is partial, stale, fragmented, unavailable, or stripped of context, AI does not create truth. It accelerates uncertainty.</p><h2>The system of record must become a defensive control plane</h2><p>For years, enterprises treated security platforms, SIEMs, and data lakes as passive repositories: places to store data for later search and analysis. That model is no longer enough.</p><p>What organizations now need is a defensive control plane: a layer that connects what happened, what it means, and what the enterprise is allowed to do about it. In architectural terms, it ties together raw machine data, business context, and policy. It does not just store evidence. It makes evidence usable for decisions and actions that must be explainable and trusted.</p><p>In practice, that means doing four things well: preserving evidence, reaching data wherever it lives, adding business context, and governing action. More on each below.</p><p>The old system of record answered one question: What is the official record?</p><p>A defensive control plane answers the questions that matter operationally: What happened? What does it mean? What evidence supports that conclusion? And what action can we trust?</p><p>AI does not reduce the need for authoritative records. It raises the standard for what those records must do.</p><h2>A defensive control plane must do four things</h2><ol><li><p><b>Preserve evidence. </b>Logs, metrics, traces, events, identity records, configuration changes, tickets, and asset state all help establish what happened. Their value often becomes clear only after an incident begins.</p></li><li><p><b>Make data accessible wherever it lives</b>. Security-relevant data is already spread across object stores, cloud platforms, operational tools, and business systems. Moving every byte into one place is often too slow, too expensive, and too difficult to govern. The better model is to bring analytics to the data.</p></li><li><p><b>Add business context. </b>Correlating machine data with business information turns “anomaly on host X” into “the system supporting payment services for top accounts is being probed.” That is what allows organizations to prioritize correctly.</p></li><li><p><b>Govern action</b>. In the agentic era, systems will do more than summarize incidents. They will enrich alerts, open cases, trigger workflows, isolate assets, update policies, and escalate decisions. Enterprises need to know what evidence an agent used, what policy governed the action, whether it stayed within scope, and how the decision can be reviewed afterward.</p></li></ol><h2>The real SOC problem is not too little data</h2><p>Modern SOCs are not suffering from a lack of data. They are suffering from a lack of usable context.</p><p>According to the Splunk State of Security 2025 report, SOC analysts continue to struggle with too many alerts (59%), too many false positives (55%), and alerts that lack context (46%). The issue is not data volume. It is the difficulty of turning fragmented signals into trusted decisions.</p><p>Today, analysts are left stitching together context manually, pivoting across disconnected tools, and making high-stakes decisions without the full picture in time. Even as AI improves, outcomes still depend on whether humans are willing to approve changes across fragmented environments.</p><p>This creates a daily crisis of context. Teams are forced to make consequential decisions based on data they cannot easily see, correlate, or trust. The result is latency, inconsistency, missed opportunities, and unnecessary risk.</p><h2>Trusted action is the durable advantage</h2><p>A data fabric architecture offers a way forward by creating a unified, intelligent layer across data sources spanning SecOps, ITOps, and NetOps. The goal is not centralization for its own sake. It is to break down silos and deliver context-rich insight at the speed AI-driven operations require.</p><p>This is an operating model before it is a product. AI-driven defense depends on a foundation that can preserve evidence, reach data where it lives, add context, and maintain a reviewable link between data, decision, and action. That is the architectural shift behind Cisco Data Fabric powered by the Splunk Platform, which brings together machine data, federation, business context, governance, and provenance to help teams move from signal to trusted action.</p><p>Attackers will keep making deception cheaper, faster, and more personalized. Defenders do not win that race by generating more noise. They win by making truth faster, and by grounding every action in evidence that people and machines can trust.</p><p><i>Learn more about the </i><a href="https://www.splunk.com/ciscodatafabric"><i>Cisco Data Fabric powered by the Splunk Platform</i></a>.</p><p><i>Seth Brickman is VP, Global Product - Splunk Platform, Cisco.</i></p><hr/><p><i>Sponsored articles are content produced by a company that is either paying for the post or has a business relationship with VentureBeat, and they’re always clearly marked. For more information, contact </i><a href="mailto:sales@venturebeat.com"><i><u>sales@venturebeat.com</u></i></a><i>.</i></p>]]></description>
            <category>Security</category>
            <enclosure url="https://images.ctfassets.net/jdtwqhzvc2n1/591Zr5l5GE1ceQqivX0qqw/9e29de38bbca6dedc88795bdf38bd92f/Gemini_Generated_Image_m4qphtm4qphtm4qp.png?w=300&amp;q=30" length="0" type="image/png"/>
        </item>
        <item>
            <title><![CDATA[MCP solved tool calling. A2A solved coordination. What solves transport?]]></title>
            <link>https://venturebeat.com/orchestration/mcp-solved-tool-calling-a2a-solved-coordination-what-solves-transport</link>
            <guid isPermaLink="false">4PHKoyZ3cKlEPNqLucajQH</guid>
            <pubDate>Sun, 14 Jun 2026 04:00:00 GMT</pubDate>
            <description><![CDATA[<p>The history of distributed computing is one of protocol proliferation followed by consolidation. </p><p>Common Object Request Broker Architecture (CORBA), Distributed Component Object Model (DCOM), Java remote method invocation (RMI), and early simple object access protocol (SOAP) competed for the enterprise integration market in the late 1990s before representational state transfer (REST) quietly won by being simpler and HTTP-native. </p><p>Extensible Messaging and Presence Protocol (XMPP), Internet Relay Chat (IRC), and a dozen proprietary protocols fragmented real-time messaging before MG telemetry transport (MQTT) and WebSockets carved out their respective niches. Every new computing paradigm generates a burst of competing standards, then slowly converges as implementations accumulate and interoperability becomes economically necessary.</p><p>The AI agent ecosystem is currently in the proliferation phase. Four significant protocols have been published in the past eighteen months: Model context protocol (MCP) from Anthropic in late 2024, agent communication protocol (ACP) from IBM Research in March 2025, Agent2Agent (A2A) from Google in April 2025, and agent network protocol (ANP) from an independent working group. </p><p>The W3C AI Agent Protocol Community Group has opened a standards track. The Internet Engineering Task Force (IETF) is receiving Internet-Drafts on agent transport. Conferences are running workshops on interoperability. Every week brings a new GitHub repository claiming to solve the agent communication problem.</p><p>Understanding where and how quickly this converges has real consequences for architecture decisions being made right now.</p><h2><b>What the protocols actually solve</b></h2><p>The proliferation looks more chaotic than it is, because most of these protocols address different layers of a stack rather than competing for the same slot. The confusion comes from marketing, which describes each as &quot;the standard for AI agent communication&quot; without specifying which aspect of communication.</p><p>MCP is a tool-calling interface. It defines how a model discovers what functions a server exposes, how to invoke them, and how to interpret the response. It is a typed remote procedure call (RPC) contract between a model client and a tool server, running over HTTP. The Linux Foundation confirmed more than 10,000 active public MCP servers and 164 million monthly Python SDK downloads by April 2026. MCP has already won the tool-calling layer. The standardization work is effectively done.</p><p>A2A is a task coordination interface. Where MCP defines how an agent calls a tool, A2A defines how two agents delegate a task. It introduces Agent Cards (capability advertisements), task lifecycle states, and three interaction modes: Synchronous, streaming, and asynchronous. Google donated it to the Linux Foundation in June 2025, and enterprise AI teams have adopted it broadly because it fills a real gap that MCP leaves open.</p><p>ACP is a message envelope format. Lightweight, stateless, designed for agent-to-agent message exchange without A2A&#x27;s full coordination semantics. It is useful in systems where simple message passing suffices and A2A&#x27;s task lifecycle overhead is unnecessary.</p><p>ANP is a discovery and identity protocol. It uses Decentralized Identifiers (DIDs) for agent identity and JSON-LD graphs for capability descriptions, providing a foundation for decentralized agent marketplaces where no central registry is required.</p><p>The stack that is emerging: Capability discovery via ANP or simpler registries, task coordination via A2A, tool calls via MCP, and lightweight messaging via ACP for cases that do not require full task lifecycle management. These layers complement rather than compete.</p><h2><b>The transport problem that remains</b></h2><p>Every protocol in this list runs over HTTP. This reflects where the protocols came from: Research teams, API providers, and enterprise software companies building systems where HTTP is an unquestioned assumption. HTTP is the protocol they know, the one their servers already speak, and the one that makes demos easy.</p><p>The production problem is that HTTP assumes a reachable server. Behind network address translation (NAT) — and 88% of networked devices sit behind NAT — there is no reachable server without a relay. For agent fleets that need to route tasks directly between peers across cloud boundaries, home networks, and edge deployments, this centralization forces every message through relay infrastructure. Relay infrastructure adds latency, cost, and a failure mode.</p><p>The application-layer protocols solve the semantics of what agents say to each other. They do not solve how agents find each other and establish direct connections. That is a session-layer problem, Layer 5 in the open systems interconnection (OSI) model and none of MCP, A2A, ACP, or ANP address it.</p><p>The technologies for solving it exist. UDP hole-punching with session traversal utilities for NAT (STUN) provides NAT traversal for roughly 70% of network topologies. X25519 Diffie-Hellman and AES-256-GCM provide authenticated encryption at the tunnel level without a certificate authority. Quick UDP internet connections (QUIC) (RFC 9000) or custom sliding-window protocols over user datagram protocol (UDP) provide reliable delivery without TCP&#x27;s head-of-line blocking. These are the same primitives that WireGuard uses for VPN tunnels and that WebRTC uses for browser-to-browser media streams.</p><p>What differs in the agent context is capability-based routing. Agents need to find peers not by hostname but by what those peers can do. A research agent should be able to query &quot;which peers have real-time foreign exchange data?&quot; and receive a list of currently active specialist agents. This is closer to a service registry than to DNS, and it is a natural extension of ANP&#x27;s design philosophy applied to the transport layer.</p><p>A handful of projects are assembling these pieces. Pilot Protocol has the most complete published specification, with an IETF Internet-Draft covering addressing, tunnel establishment, and NAT traversal for agent networks. libp2p provides a battle-tested foundation with similar primitives. The IETF&#x27;s QUIC working group is developing NAT traversal extensions that will be relevant here.</p><h2><b>What convergence will look like</b></h2><p>The HTTP-based protocols (MCP, A2A) are already converging on stable versions. The next 12 months will see production hardening, security improvements, stateless MCP servers for horizontal scaling, better A2A federation — rather than new fundamental designs. The tool-calling and task-coordination layers are largely solved.</p><p>The transport layer is 18 to 24 months behind. Expect a period of implementation diversity as teams experiment with different approaches to peer-to-peer (P2P) agent networking, followed by consolidation around a small number of implementations once empirical data on performance and reliability accumulates. The IETF and W3C standardization tracks will likely produce something in the 2027-2028 window, by which time one or two open-source implementations will have accrued enough production deployments to establish de facto standards ahead of the formal specification.</p><p>For engineering leaders making architecture decisions today, the practical implication is layered adoption. The application-layer protocols are stable enough to build on. MCP adoption now is low-risk. A2A adoption for multi-agent coordination is reasonable with the expectation that the protocol will evolve. The transport layer is where you either build something custom and plan to replace it, or you evaluate early implementations knowing the space is still moving.</p><p>The teams that will have the most leverage when the transport layer stabilizes are the ones that designed their agent systems with a clean separation between application semantics (MCP, A2A) and transport (whatever sits below). Clean separation is cheap to implement now and expensive to retrofit later, a lesson the microservices era taught anyone who tried to add observability or circuit breaking to systems that had none.</p><p><i>Philip Stayetski is a co-founder of Vulture Labs.</i></p>]]></description>
            <category>Orchestration</category>
            <category>DataDecisionMakers</category>
            <enclosure url="https://images.ctfassets.net/jdtwqhzvc2n1/F3OLgvGIreNtAZkWpzQuo/3263a964f4df4a6645e785c0abb58f23/Data_blocks.png?w=300&amp;q=30" length="0" type="image/png"/>
        </item>
        <item>
            <title><![CDATA[Anthropic blocks all public access to Claude Fable 5, Mythos 5 following US government order — what enterprises should do]]></title>
            <link>https://venturebeat.com/technology/anthropic-blocks-all-public-access-to-claude-fable-5-mythos-5-following-us-government-order-what-enterprises-should-do</link>
            <guid isPermaLink="false">AMjLirPGAfkycxko0iX3C</guid>
            <pubDate>Sat, 13 Jun 2026 12:24:00 GMT</pubDate>
            <description><![CDATA[<p>The US government last night issued an unprecedented export control directive <a href="https://www.anthropic.com/news/fable-mythos-access">ordering Anthropic to immediately suspend</a> all access to its top-tier Claude Fable 5 and Claude Mythos 5 models for foreign nationals, citing unspecified national security authorities. </p><p>In response, Anthropic has blocked <i>all </i>public access to both models, globally — meaning no users around the world can access them at this time, even paying enterprise customers and Anthropic employees internally. It&#x27;s a huge blow and reversal following the <a href="https://venturebeat.com/technology/anthropic-brings-mythos-to-the-masses-with-claude-fable-5-its-most-powerful-generally-available-model-ever">public release of Fable/Mythos 5</a> just three days prior. </p><p>Current Fable 5/Mythos 5 sessions will end in errors and new queries will be automatically routed to older, less capable models like Opus 4.8. Anthropic says in a<a href="https://www.anthropic.com/news/fable-mythos-access"> blog post </a>that &quot;We believe this is a misunderstanding and are working to restore access as soon as possible,&quot; and apologizes to its customers. </p><p>The sudden regulatory intervention serves as a stark warning to the enterprise sector: centralized, cloud-based frontier models exist at the absolute mercy of government oversight and vendor compliance.</p><h2><b>Did Pliny the Liberator&#x27;s public jailbreak catalyze the extraordinary USG action against Fable/Mythos 5?</b></h2><p>The government&#x27;s sweeping action follows a<a href="https://x.com/elder_plinius/status/2064776322979676227"> viral jailbreak of Fable 5 published publicly on X on June 10</a> by the prolific jailbreaker &quot;<a href="https://venturebeat.com/ai/an-interview-with-the-most-prolific-jailbreaker-of-chatgpt-and-other-leading-llms">Pliny the Liberator</a>,&quot; who claimed to have successfully bypassed the model&#x27;s safety guardrails to extract functional instructions for cyber exploits, explosives, and chemical synthesis pathways, specifically noting the &quot;birch reduction method&quot; for methamphetamine.</p><p>Pliny outlined a highly sophisticated, multi-agent attack that leveraged a combination of &quot;Unicode, homoglyphs, Cyrillic,&quot; long-context reference tracking, and a technique of breaking harmful requests into innocuous, out-of-distribution tokens. The attacker then used a previously jailbroken Opus model to piece the benign chunks back together into actionable, restricted outputs.</p><p>Anthropic doesn&#x27;t specify if this is the jailbreak that precipitated the government order, and in fact, notes that the information provided by the U.S. government regarding the specific jailbreak has been poorly documented, writing: &quot;To date, the government has only given us verbal evidence of a potential narrow, non-universal jailbreak, which essentially consists of asking the model to read a specific codebase and fix any software flaws. Our understanding is that one potential jailbreak was shared with the government.&quot;  </p><p>The company argues the capabilities uncovered are &quot;widely available&quot; in other public models, explicitly naming rival OpenAI&#x27;s GPT-5.5. </p><p>Furthermore, Anthropic warns that pulling a commercial model over a non-universal jailbreak sets a regulatory standard that could &quot;essentially halt all new model deployments for all frontier model providers&quot;.</p><h2><b>The Pentagon precedent and need for enterprise AI redundancy and diversification</b></h2><p>This sudden blackout of Anthropic&#x27;s latest and greatest AI models will no doubt cause some consternation for organizations relying primarily on the Claude API — as it should, even though they still have access to other, less powerful Claude models. </p><p>As I warned earlier this year <a href="https://venturebeat.com/technology/anthropic-vs-the-pentagon-what-enterprises-should-do">when the Pentagon abruptly blacklisted Anthropic</a>, enterprises can no longer afford — from an operational reliability standpoint — to run critical workflows on <i>any</i> single AI model or even provider. Putting all your AI &quot;eggs&quot; into one basket, so to speak, creates a single, ultimately brittle failure point from which recovery or mitigation becomes exceedingly difficult. </p><p>Granted, in this case, Anthropic notes helpfully that &quot;access to all other Anthropic models will not be affected.&quot; And while Opus 4.8 or other Anthropic models may already be the preferred ones for organizations given their lower cost, or seen as acceptable fallbacks, the reality is, the U.S. government order was narrowly targeted <i>in this particular instance — </i>who&#x27;s to saying the government wouldn&#x27;t, in the future, demand a block of <i>all of a given lab&#x27;s AI models/products/services?</i></p><p>We had an indication that enterprise AI customers should diversify their providers earlier this year. Recall that in March 2026, Secretary of Defense Pete Hegseth labeled Anthropic a &quot;supply chain risk&quot; after the company refused to allow the military to use Claude for mass domestic surveillance and lethal autonomous weapons without safety restrictions. </p><p>The resulting fallout led to a sweeping prohibition on Anthropic&#x27;s use across defense supply chains, stripping contractors of access overnight.</p><p>The lesson from the Department of Defense fallout remains critically relevant today. Any organization building agentic workflows or production apps tied solely to a single closed-API provider risks immediate operational failure if that provider faces an injunction, a cyberattack, or an export control directive.</p><p>As an enterprise technical leader, your top goal if not already achieved should be to urgently<b> diversify your AI supply</b> — whether it&#x27;s other cloud-based AI models and providers, or AI models running on enterprise-controlled local or virtual hardware. </p><p>At this point, enterprise AI supplier diversification is arguably imperative to ensure you can continue to run AI workflows without disruption. </p><h2><b>Enterprise implications: sovereign setup vs. frontier capabilities</b></h2><p>The community reaction to the Fable 5 takedown reflects a rapidly shifting enterprise calculus toward hardware sovereignty.</p><p>AI founder <a href="https://x.com/AlexFinn/status/2065614148537299149?s=20">Alex Finn took to X </a>to flag the Anthropic shutdown as a &quot;wakeup call,&quot; urging developers to run local models on home GPUs to insulate themselves from regulatory volatility. </p><p>&quot;No company or government will EVER be able to take away your local models,&quot; Finn writes, warning that government overreach will only escalate as models inch closer to artificial general intelligence (AGI), the stated goal of OpenAI and some other AI firms, in which an AI model becomes capable of performing most economically valuable work tasks now done by humans. </p><p>Competitors are already capitalizing on this sentiment; Chinese open source AI provider MiniMax quickly highlighted the open weights/open source availability of its <a href="https://venturebeat.com/technology/minimax-m3-debuts-eclipsing-gpt-5-5-and-gemini-3-1-pro-on-key-benchmark-performance-for-just-5-10-of-the-cost">new, frontier-class M3 model</a>, contrasting its decentralized availability against Claude&#x27;s centralized vulnerability. In other words: enterprises can download and run M3 on their own hardware now without ever worrying about any government stepping in to prevent access. </p><p>This dynamic presents a complex trade-off for CIOs and IT leaders:</p><ul><li><p><b>The Sovereign Advantage: </b>Running local, open-weights models on sovereign hardware provides absolute control, ensures data privacy, and immunizes the enterprise against abrupt government export controls, vendor policy shifts, or API rate limits.</p></li><li><p><b>The Frontier Sacrifice:</b> Adopting a purely local strategy means sacrificing the cutting-edge reasoning, agentic capabilities, and massive context windows inherent to the latest closed-API frontier models, which require centralized, multi-billion-dollar compute clusters to operate.</p></li></ul><p>The most resilient path forward is an active fallback architecture. Enterprises must design their systems to be model-agnostic. By building intelligent routing layers that can dynamically switch from a frontier model like Fable 5 to an open-weights fallback or a secondary provider&#x27;s API the moment an outage or regulatory ban hits, businesses ensure their operations survive the volatile intersection of AI scaling and government oversight.</p>]]></description>
            <author>carl.franzen@venturebeat.com (Carl Franzen)</author>
            <category>Technology</category>
            <enclosure url="https://images.ctfassets.net/jdtwqhzvc2n1/2z5YLKjoeV6nzBjKELk7Yo/900a955fa4e5390ad6ea0921e64a7b85/ChatGPT_Image_Jun_13__2026__08_09_56_AM.png?w=300&amp;q=30" length="0" type="image/png"/>
        </item>
    </channel>
</rss>