US Researchers Launch A DeepSeek Competitor

A small team of researchers at Stanford and Washington Universities have created an advanced and very significant AI reasoning model, named s1, for an incredibly low cost of under $50. 

This is highly significant in an industry where developing similar models takes many millions of dollars in resource and infrastructure costs at a time of  growing competition in the AI reasoning field.

For the purpose of comparion, Chinese startup DeepSeek recently made a big impact with its own reasoning model, R1, which the company claims to have been developed for just $6 million.

The s1 model can complete complex reasoning tasks, and has performed in similar ways to OpenAI’s o1 and DeepSeek’s R1 with maths and coding. However, critics are questioning the accuracy of DeepSeek’s claims, and also expressed their concerns regarding the safety and security of its models.

Low Cost Of s1’s Development

This process involves training s1 to mimic the reasoning abilities of an existing AI model, specifically, Google’s Gemini 2.0 Flash Thinking experimental model. By using a curated dataset of 1,000 questions and answers, paired with reasoning traces from the Gemini model, s1 learned how to arrive at accurate solutions in a fraction of the time and cost compared to traditional methods.

According to the researchers, training s1 took just 26 minutes using 16 Nvidia H100 GPUs, costing just $20 in total.

The researchers used what they call Supervised Fine-Tuning (SFT), a method that involves guiding the model with explicit instructions to accelerate the learning process. One particularly interesting development during s1’s creation was the introduction of a “wait” instruction, which helped improve its accuracy. By incorporating pauses into the model’s reasoning process, the researchers found that s1 was able to double-check its responses, often correcting errors and leading to more accurate conclusions.

The researchers behind s1 hope their work will drive open innovation, making powerful reasoning models more accessible to the global community and accelerating advancements in AI technology for the benefit of society. 

However, a higher level of investment may still be necessary to push the envelope of AI innovation. 

The shrort-cut methods used by s1 and R1 (sometimes referred to as distillation) are demonstrably a good method for cheaply re-creating an AI model’s capabilities, but they don’t create new AI models vastly better than what is already available.

arXiv   |    I-HLS    |   Interesting Engineering     |  Tech Xplore   |  Mashable  | Tech Crunch   |   Yahoo

Image: Igor Kutyaev

You Might Also Read: 

A History Of Artificial Intelligence: Its Current & Future Development:


If you like this website and use the comprehensive 6,500-plus service supplier Directory, you can get unrestricted access, including the exclusive in-depth Directors Report series, by signing up for a Premium Subscription.

  • Individual £5 per month or £50 per year. Sign Up
  • Multi-User, Corporate & Library Accounts Available on Request

Cyber Security Intelligence: Captured Organised & Accessible


 

« Thai Police Arrest Russian Hackers
Business Interruption Is The #1 Cyber Risk »

CyberSecurity Jobsite
Perimeter 81

Directory of Suppliers

The PC Support Group

The PC Support Group

A partnership with The PC Support Group delivers improved productivity, reduced costs and protects your business through exceptional IT, telecoms and cybersecurity services.

Authentic8

Authentic8

Authentic8 transforms how organizations secure and control the use of the web with Silo, its patented cloud browser.

CYRIN

CYRIN

CYRIN® Cyber Range. Real Tools, Real Attacks, Real Scenarios. See why leading educational institutions and companies in the U.S. have begun to adopt the CYRIN® system.

DigitalStakeout

DigitalStakeout

DigitalStakeout enables cyber security professionals to reduce cyber risk to their organization with proactive security solutions, providing immediate improvement in security posture and ROI.

Resecurity

Resecurity

Resecurity is a cybersecurity company that delivers a unified platform for endpoint protection, risk management, and cyber threat intelligence.

National Crime Agency (NCA) - United Kingdom

National Crime Agency (NCA) - United Kingdom

The NCA's Cyber Crime Unit focuses on critical cyber incidents in the UK as well as longer-term activity against the criminals and the services on which they depend.

PubNub

PubNub

PubNub enables developers to build secure realtime Mobile, Web, and IoT Apps.

Security Research Labs (SRLabs)

Security Research Labs (SRLabs)

Security Research Labs is a Berlin-based hacking research collective and consulting think tank.

EverC

EverC

EverC (formerly EverCompliant) is a leading provider of cyber intelligence that allows acquiring banks and payment service providers (PSP) to manage cyber risk.

Ideagen

Ideagen

Ideagen provides information management, safety, risk and compliance software solutions that allow organisations to achieve operational excellence, regulatory compliance and reduce risk.

PlainID

PlainID

PlainID provides IAM teams with a simple and intuitive means to control their organization’s entire authorization process.

Immersive Labs

Immersive Labs

Immersive Labs have created a kinesthetic learning platform which identifies gaps in your teams cyber skills.

Cybersecurity Collaborative

Cybersecurity Collaborative

CyberSecurity Collaborative is a forum for CISOs to share information that will collectively make us stronger, and better equipped to protect our enterprises from those seeking to damage them.

Cybersecurity Coalition

Cybersecurity Coalition

The mission of the Cybersecurity Coalition is to bring together leading companies to help policymakers develop consensus-driven policy solutions to achieve improvements in cybersecurity.

Sum&Substance (Sumsub)

Sum&Substance (Sumsub)

Sum&Substance is a developer of remote verification solutions. Our technology allows online services around the world to meet regulatory requirements, prevent fraud and enhance customer confidence.

Corellium

Corellium

Corellium are dedicated to supporting our peers in the ARM community who seek to build more secure, performant, and accessible software and devices.

International Association of Financial Crimes Investigators (IAFCI)

International Association of Financial Crimes Investigators (IAFCI)

International Association of Financial Crimes Investigators provides services and information about financial fraud, fraud investigation and fraud prevention.

Darktrace

Darktrace

Darktrace is a global leader in cybersecurity AI, delivering complete AI-powered solutions in its mission to free the world of cyber disruption.

Positka FSI Pte Ltd

Positka FSI Pte Ltd

Positka, being a Splunk Singapore partner, provides Splunk & Phantom Services, Cybersecurity & Risk Management, Analytics & Big Data, Lean Process Optimization, and Managed Security Services.

SecureFlag

SecureFlag

SecureFlag is dedicated to enhancing secure coding across all technical profiles within the Software Development Lifecycle.

CyPro

CyPro

CyPro is a cyber security expert firm that specialises in providing cyber security services tailored for high-growth companies at every stage of their journey.