DeepSeek AI Accused of Unauthorized Use of Google's Gemini Model

Jun 04, 2025

DeepSeek, a Chinese AI laboratory, is suspected of using unauthorized outputs from Google's (GOOGL, Financial) Gemini model to train its upgraded inference model, R1-0528. This model, launched recently, has shown impressive performance in various tests but lacks transparency regarding its training data sources.

AI developer Sam Paech highlighted on X Platform that the R1-0528's vocabulary and sentence structure closely resemble Google's latest Gemini 2.5 Pro, suggesting potential misuse of Gemini's outputs. Similarly, an anonymous developer and founder of SpeechMap noted that DeepSeek’s model reasoning traces read similarly to content produced by Gemini, questioning the data's originality.

This is not the first accusation against DeepSeek. In December 2023, its V3 model had aroused suspicions of using OpenAI's chat records due to frequent self-references to ChatGPT. OpenAI disclosed that DeepSeek might have used data distillation, an AI development technique, to extract data from stronger language models to train its AI, breaching OpenAI's terms. Microsoft (MSFT), an OpenAI partner, detected significant data leaks linked to DeepSeek by the end of 2024.

Efforts are being made to enhance data protection, with OpenAI and Google implementing measures to safeguard against unauthorized data usage. However, AI's widespread online content has made filtering training data increasingly challenging.

Disclosures

I/We may personally own shares in some of the companies mentioned above. However, those positions are not material to either the company or to my/our portfolios.

Research Tools

All-In-One Screener Stock Ideas Stock List Guru List Guru Real-Time Picks Insider List Insider Trades Economic Indicators Sector & Industry Performance DCF Calculator Discussion Board

Product

Pricing Plans Excel Add-In Google Sheets Add-on Data API Stock Comparison Table Manual of Stocks Instant Alerts Mobile App 中文

Education

Financial Glossary Tutorials FAQ Schedule Free Session Buffett Indicator Shiller P/E Yield Curve Today U.S. Inflation Rate Global Market Valuation Fed Net Liquidity Buffett Assets Allocation

Company

About GuruFocus Career Contact Us Advertise Site Map Term of Use Privacy Policy Referral Program Partner Program

Survey

We'd love to learn more about your experiences on GuruFocus.com and how we can improve!

Take Survey

Disclaimers

GuruFocus.com is not operated by a broker or a dealer. Under no circumstances does any information posted on GuruFocus.com represent a recommendation to buy or sell a security. The information on this site, and in its related newsletters, is not intended to be, nor does it constitute investment advice or recommendations. The individuals or entities selected as "gurus" may buy and sell securities before and after any particular article and report and information herein is published, with respect to the securities discussed in any article and report posted herein. Gurus may be added or dropped from the GuruFocus site at any time. In no event shall GuruFocus.com be liable to any member, guest or third party for any damages of any kind arising out of the use of any content or other material published or available on GuruFocus.com, or relating to the use of, or inability to use, GuruFocus.com or any content, including, without limitation, any investment losses, lost profits, lost opportunity, special, incidental, indirect, consequential or punitive damages. Past performance is a poor indicator of future performance. The information on this site, and in its related newsletters, is not intended to be, nor does it constitute investment advice or recommendations. The information on this site is in no way guaranteed for completeness, accuracy or in any other way. The gurus listed in this website are not affiliated with GuruFocus.com, LLC. Stock quotes are provided by QuoteMedia, Inc. (CSI). Company fundamental data is provided by Morningstar. Analyst estimates data is sourced from both Refinitiv and Morningstar, with priority given to Refinitiv data. Data is updated daily.