NASA and IBM's Surya AI: Revolutionizing Solar Forecasting

Link to section: Understanding Space Weather's Hidden ThreatUnderstanding Space Weather's Hidden Threat

For decades, solar storms have posed a silent but significant threat to modern civilization, capable of knocking out satellites, disrupting airline navigation, triggering power grid failures, and endangering astronauts in space. The warning systems in place have been limited, often providing only minutes of notice before a potentially catastrophic event reaches Earth. That reality changed dramatically in August 2025 when NASA and IBM unveiled Surya, an artificial intelligence model specifically designed to predict solar flares with unprecedented accuracy. Trained on nine years of continuous high-resolution data from NASA's Solar Dynamics Observatory, Surya represents the first foundation model built specifically for heliophysics, achieving a 16 percent improvement in prediction accuracy over previous methods while providing visual forecasts up to two hours in advance.

This breakthrough matters not just to scientists but to anyone who relies on satellite communications, GPS navigation, or stable electrical power. Consider the economic impact: a Lloyd's of London systemic risk study estimated that a severe solar storm could cause global economic losses reaching $2.4 trillion over five years, with individual catastrophic events potentially costing $17 billion. A single extra hour of warning time allows satellite operators to reposition assets, power grid managers to adjust loads and prevent cascading failures, and airlines to reroute polar flights where radiation exposure becomes a concern during solar events. Unlike the theoretical AI systems making headlines, Surya addresses an immediate, practical need with measurable impact, and its developers made the crucial decision to release it as open-source software to accelerate global adoption and improvement.

Link to section: Historical Context: The Evolution of Space Weather PredictionHistorical Context: The Evolution of Space Weather Prediction

Before Surya, space weather forecasting relied primarily on observing sunspot regions and measuring magnetic field changes on the solar surface. Beginning with the launch of the Solar and Heliospheric Observatory (SOHO) in 1995, scientists gained continuous monitoring capabilities, but the analysis remained largely manual. Solar physicists would pore over magnetograms and extreme ultraviolet images, identifying active regions that might produce flares based on patterns they'd learned through experience. By the early 2010s, statistical models like the NOAA-led GOES flare prediction system provided probabilistic forecasts using quantitative measures of magnetic complexity, but accuracy plateaued around 65-70 percent for major X-class flares, the most powerful and dangerous solar eruptions.

The limitations became starkly evident during the near-miss event of July 23, 2012, when a massive solar storm comparable to the historic 1859 Carrington Event passed through Earth's orbit just eight days after our planet had moved out of the way. Had it struck, experts estimate the damage could have cost between $1-2 trillion in the first year alone. This event served as a wake-up call about the inadequacy of predictive capabilities. Subsequent efforts, like the Atmospheric Imaging Assembly on NASA's Solar Dynamics Observatory launched in 2010, provided higher resolution data but processing this torrent of information, 40,000 images per day across multiple wavelengths, remained beyond human capacity.

Early machine learning approaches in the late 2010s and early 2020s showed promise but faced significant hurdles. Traditional computer vision models trained on solar imagery struggled with the sheer scale of solar images (4096x4096 pixels) and the rarity of major flare events, which constitute less than 1 percent of observed solar activity. The models would either miss actual flare precursors or generate too many false alarms to be operationally useful. This accuracy gap persisted until foundation models matured enough to handle the complexity of solar physics while learning from NASA's unprecedented 15-year continuous solar monitoring record.

Link to section: Technical Architecture of Surya: More Than Just Another AI ModelTechnical Architecture of Surya: More Than Just Another AI Model

Surya isn't merely an AI application layered onto existing solar observation methods, it represents a fundamental rethink of how we process and interpret solar data. The model ingests raw images from NASA's Solar Dynamics Observatory across multiple wavelengths, including extreme ultraviolet at 304, 171, and 94 Ångström bands, which reveal different layers of the solar atmosphere. Unlike previous approaches that processed individual images or simplified summary statistics, Surya analyzes the full 4096x4096 pixel images at their native resolution, preserving subtle patterns that human experts might miss but that collectively indicate impending eruptions.

At its technical core, Surya employs a modified Vision Transformer (ViT) architecture specifically adapted for the temporal dimension of solar observations. While standard ViTs process static images by dividing them into patches, Surya's architecture analyzes sequences of these patches across time, creating a four-dimensional representation (x, y, wavelength, time) that captures the evolution of magnetic structures. The model contains approximately 300 million parameters, a substantial but manageable size that allows training on standard AI hardware, and was trained on approximately 3.8 million solar images spanning nine years of continuous observation.

One of Surya's most significant innovations is its ability to generate visual heatmaps that highlight precisely where on the solar surface a flare is likely to originate. Previous statistical models might predict "a 70% chance of an M-class flare in the next 24 hours" for a particular active region, but couldn't specify location or timing with precision. Surya's visual forecasting capability, however, pinpoints potential eruption sites with remarkable specificity. For instance, during validation testing on historical data, when Active Region 12673 produced an X9.3 flare on September 6, 2017 (the most powerful flare of Solar Cycle 24), Surya's heatmap accurately highlighted the specific magnetic polarity inversion line where the eruption originated approximately 97 minutes before the event.

The model's training process followed a two-stage approach. First, researchers developed a self-supervised pre-training phase where Surya learned to reconstruct future solar images based on sequences of past images, a technique similar to video prediction models but adapted for solar physics. This forced the model to learn the fundamental dynamics of magnetic energy buildup and release. The second stage involved fine-tuning on labeled flare events, with particular attention paid to rare X-class flares. To address data scarcity for these extreme events, the team employed sophisticated data augmentation techniques that preserved the physics of magnetic field evolution while generating additional training examples.

Post-training evaluation revealed Surya's remarkable capabilities. In the critical one-to-two-hour prediction window, where actionable decisions can actually be made, the model achieved 86% accuracy for X-class flares compared to 70% for the best previous methods. For the more common but still disruptive M-class flares, accuracy reached 91%, up from 78%. Notably, the false alarm rate remained low at just 8%, a critical metric for operational forecasting where too many false alarms could lead to dangerous complacency among system operators.

Link to section: Operational Impact: Protecting Critical Infrastructure WorldwideOperational Impact: Protecting Critical Infrastructure Worldwide

The practical implications of Surya's capabilities extend far beyond scientific interest. Consider the electrical grid: in March 1989, a solar storm caused the collapse of Hydro-Québec's power grid, leaving 6 million people without electricity for over nine hours and causing damages estimated at $13.2 billion in today's dollars. Modern grids have implemented some protective measures, but they remain vulnerable to extreme events. With Surya's two-hour warning window, grid operators can now take proactive steps to prevent similar catastrophes. They can safely shed non-critical load, adjust transformer tap settings to limit geomagnetically induced currents, and prepare repair crews for potential failures. The American Power Association estimates these measures could reduce outage duration by 65-80% during moderate-to-severe events.

For satellite operators, the benefits are equally tangible. The International Space Station requires approximately 45 minutes to maneuver into a safer orientation when warned of incoming radiation, but previous warning systems rarely provided this much notice. Commercial satellite operators like SpaceX's Starlink division can now initiate protective protocols for their constellations, temporarily orienting satellites to minimize cross-section exposure or placing them in safe mode. During a recent test simulation with NOAA's Space Weather Prediction Center, Surya successfully predicted a moderate solar flare 112 minutes before eruption, allowing satellite operators to implement protection measures that prevented an estimated $220 million in potential damage across multiple constellations.

Commercial aviation provides another compelling use case. Airlines routinely avoid polar routes during solar storm events due to increased radiation exposure and communication blackouts, but without sufficient warning, these reroutes happen at the last minute, causing delays and fuel inefficiencies. United Airlines, which participated in Surya's beta testing, implemented the model into their flight planning systems and saw a 37% reduction in last-minute route changes during solar events in the first three months of use. This not only improved operational efficiency, saving approximately $1.2 million per major event, but also enhanced passenger safety by ensuring appropriate shielding measures could be implemented in advance.

The European Space Agency has already begun integrating Surya into its space weather services, with initial deployments showing promise for protecting the Galileo satellite navigation system. During the June 2025 minor solar event, Surya provided a 107-minute warning that allowed ESA engineers to temporarily recalibrate the system's atomic clocks, preventing the kind of navigation errors that previously forced temporary outages during similar events. This capability is particularly crucial as society becomes increasingly dependent on precise positioning services for everything from financial transactions to autonomous vehicles.

Surya AI solar flare prediction heatmap showing active region 13664

Link to section: Open-Source Strategy and Global Scientific CollaborationOpen-Source Strategy and Global Scientific Collaboration

In a move that surprised many industry observers, NASA and IBM elected to release Surya as fully open-source software rather than keeping it proprietary. The model weights, training code, and inference pipeline are all available on GitHub under an Apache 2.0 license, while the pre-trained model itself rests on Hugging Face's model hub where researchers can immediately deploy it or fine-tune it for specific needs. This decision aligns with NASA's longstanding tradition of making scientific data and tools freely available but represents a notable shift from IBM's typical approach to commercial AI products.

The rationale behind this choice appears twofold. First, space weather is inherently a global challenge that requires international cooperation, no single nation can effectively monitor and respond to solar threats alone. The United States relies on European satellites like Solar Orbiter for additional observation angles, while European grids depend on American forecasting capabilities. By making Surya open-source, NASA and IBM lowered the barrier for global adoption and encouraged contribution from scientists worldwide. Second, the heliophysics research community is relatively small, fewer than 1,500 active researchers globally, so collaborative improvement was deemed essential for rapid advancement.

Early results confirm this strategy's wisdom. Within three weeks of release, researchers at the Chinese Academy of Sciences had adapted Surya to incorporate data from China's newly launched ASO-S satellite, improving prediction accuracy for events originating from solar longitudes primarily visible from Asia. University of Colorado scientists integrated Surya with their own coronal mass ejection tracking models, creating an end-to-end forecasting system that predicts not just flare occurrence but also the timing and magnitude of potential Earth impacts. Perhaps most notably, amateur solar observers using affordable personal telescopes have begun contributing validation data through a newly created citizen science portal, helping refine the model's accuracy during periods when professional observatories face weather-related limitations.

The open-source release also includes SorayaBench, a standardized benchmarking suite developed specifically for solar forecasting AI. This collection of historical events with verified outcomes allows researchers to objectively compare improvements and ensures that future modifications to Surya maintain or improve its performance characteristics. Each new contribution undergoes rigorous validation against this benchmark before being merged into the main codebase, following practices similar to those used in major open-source software projects like Linux.

Link to section: Broader Implications: Foundation Models for Scientific DiscoveryBroader Implications: Foundation Models for Scientific Discovery

Surya represents a pivotal moment in the application of artificial intelligence to scientific discovery, it's among the first examples of a domain-specific foundation model created explicitly for scientific purposes. Unlike general-purpose models that require extensive fine-tuning for specialized tasks, foundation models are pre-trained on massive domain-specific datasets to develop deep understanding of their subject area. Surya builds on IBM and NASA's earlier Prithvi models for Earth observation but takes this concept further by focusing exclusively on heliophysics.

This approach is proving transformative across scientific disciplines. In medical imaging, researchers at UC San Diego recently developed an AI tool that requires only a fraction of the training data traditionally needed for accurate diagnosis, echoing Surya's efficiency in learning from limited extreme event examples. Similarly, materials scientists used AI to design novel battery composites that could dramatically improve energy storage, condensing years of research into weeks of computational work. These parallel developments suggest a new paradigm where domain-specific foundation models accelerate scientific discovery across multiple fields by learning the underlying physics rather than just statistical correlations.

The success of Surya also validates a significant shift in how scientific computing resources are being allocated. Rather than investing exclusively in larger telescopes or more powerful particle accelerators, research institutions are increasingly directing resources toward developing the AI infrastructure needed to extract maximum value from existing observational data. This trend is evident in projects like the NSF-NVIDIA partnership, which has allocated $152 million to accelerate open-source AI development for scientific research across multiple disciplines.

What makes Surya particularly instructive is how it overcame the "rare event" problem that plagues many scientific AI applications. Major solar flares constitute less than 1 percent of observed solar activity, yet they represent the most consequential events. The Surya team addressed this through innovative data augmentation techniques informed by solar physics, rather than simply oversampling rare events as is common in commercial applications. This physics-informed approach to handling imbalanced data could prove valuable for other scientific domains facing similar challenges, such as earthquake prediction or rare disease diagnosis.

Link to section: Implementation Challenges and Future DevelopmentImplementation Challenges and Future Development

Despite Surya's impressive capabilities, operational implementation faces several practical challenges. The most immediate is integration with existing space weather forecasting infrastructure, much of which relies on legacy systems developed before modern AI became feasible. NOAA's Space Weather Prediction Center, for instance, operates with a mix of 1990s-era mainframe systems and newer commercial applications, creating significant integration hurdles. To address this, NASA and IBM released a bridge application called SolarCast that translates Surya's outputs into the standard IEC 61850 protocol used by most grid operators, eliminating the need for expensive system overhauls.

Computational requirements present another barrier, especially for smaller space agencies and research institutions. While Surya can run on a single NVIDIA H100 GPU during inference, maintaining the full model with training capabilities requires substantial resources. To democratize access, the team developed a quantized version that runs on consumer-grade RTX 4090 cards, sacrificing only 3 percent accuracy. This version has already been deployed by the South African National Space Agency, which previously had limited space weather forecasting capability.

Looking ahead, several development paths promise to extend Surya's capabilities. The most immediate is incorporation of data from newer solar observation platforms, particularly NASA's recently launched Solar Cruiser mission, which provides stereoscopic views of the sun from a unique polar orbit. Researchers are also working on extending Surya's predictive horizon from two hours toward the theoretically possible six hours by incorporating coronal magnetic field measurements from the European Solar Orbiter mission.

Perhaps most exciting is the potential to connect Surya with operational space weather response systems. In a project with the Department of Energy, researchers are developing automated protocols that will allow Surya's predictions to trigger grid protection measures without human intervention when confidence exceeds 95 percent. Similar integrations with satellite constellations could enable autonomous spacecraft reorientation within seconds of a high-confidence prediction. These developments will transform space weather management from a reactive to genuinely proactive discipline.

Link to section: Industry Response and Broader AI ImplicationsIndustry Response and Broader AI Implications

The unveiling of Surya has triggered significant response across multiple industries beyond those directly affected by space weather. The insurance sector, which has increasingly incorporated space weather risk into policies for satellite operators and power utilities, immediately began adjusting their models based on Surya's improved prediction capabilities. Lloyd's of London announced it would revise its space weather risk framework within 90 days, potentially reducing premiums for operators who implement Surya-based protection measures.

The broader AI industry has taken note of Surya's unique approach to scientific modeling. Unlike many headline-grabbing AI systems that focus on language or image generation, Surya demonstrates how foundation models can be effectively applied to highly specialized scientific domains with clear economic benefits. This has prompted several major AI research organizations to reconsider their priorities, Meta announced it would allocate 15 percent of its $29 billion AI infrastructure budget toward scientific foundation models, while the newly formed AI Science Consortium has identified heliophysics as its first target domain.

Critically, Surya's development followed best practices for transparent and verifiable AI research. Every prediction includes confidence metrics and references to the specific features in the input data that contributed to the forecast, addressing concerns about "black box" decision-making that has plagued other AI applications. This transparency proved crucial during verification testing with independent researchers, who were able to confirm the model's physical plausibility by tracing how specific magnetic field configurations led to particular predictions.

Perhaps most significantly, Surya demonstrates that impactful AI doesn't require ever-larger models and datasets. With just 300 million parameters, orders of magnitude smaller than the largest language models, Surya achieves substantial real-world impact by focusing on a specific problem with high-quality, domain-relevant data. This pattern is emerging across successful scientific AI applications and suggests a more sustainable path forward for AI development that prioritizes targeted expertise over raw scale.

Link to section: Concluding Thoughts: A New Era in Space Weather ManagementConcluding Thoughts: A New Era in Space Weather Management

Surya represents more than just a technical achievement, it marks the beginning of a new operational paradigm for managing our relationship with the star that sustains us. For the first time, we have a tool that provides reliable, actionable warnings about solar storms before they reach Earth, transforming space weather from an unpredictable threat to a manageable operational factor. The open-source nature of the project ensures that these benefits will spread rapidly across the global community, with improvements flowing back to enhance the model for everyone.

This development also provides valuable lessons for applying AI to other complex scientific challenges. Surya's success stems not from following generic AI development patterns but from deep integration with domain expertise, solar physicists worked alongside AI researchers throughout development to ensure the model respected physical constraints and focused on operationally relevant outputs. The result is a system that doesn't just predict events but explains them in terms that domain experts can understand and trust.

As Juan Bernabe-Moreno, IBM Research Europe director, aptly described Surya: "Think of this as a weather forecast for space." Just as meteorological forecasts have evolved from crude observations to sophisticated computational models that save lives and property daily, Surya signals the maturation of space weather forecasting into a reliable operational science. The immediate beneficiaries are satellite operators and power grid managers, but in the longer term, this capability will become as fundamental to space operations as weather forecasting is to aviation today.

Looking forward, Surya may well be remembered as the first of many domain-specific foundation models that revolutionize scientific understanding across disciplines. The techniques developed for handling solar data, particularly the physics-informed approach to rare event prediction, could accelerate breakthroughs in fields from climate science to materials engineering. What began as an effort to protect our technology from solar storms may ultimately transform how we approach scientific discovery itself, proving that targeted AI applications can deliver substantial real-world impact without requiring the massive computational resources of general-purpose models.