🕐 --:--
-- --
عاجل
⚡ عاجل: كريستيانو رونالدو يُتوّج كأفضل لاعب كرة قدم في العالم ⚡ أخبار عاجلة تتابعونها لحظة بلحظة على خبر ⚡ تابعوا آخر المستجدات والأحداث من حول العالم
⌘K
AI مباشر | -- مشاهد مباشر
905,926 مقال 401 مصدر نشط 228 قناة مباشرة 4,804 خبر اليوم
آخر تحديث: منذ 4 ثواني

News outlets like NYT and USA Today are blocking the Internet Archive’s Wayback Machine to prevent AI training models from using their content

تكنولوجيا
فورتشن العربية
2026/04/15 - 14:01 513 مشاهدة
تحليل ذكي | AI Editorial Analysis

What some consider to be the digital library of Alexandria is in danger of losing valuable scrolls.

Major media outlets are blocking the Internet Archive’s Wayback Machine from saving web pages to prevent AI giants from training models on snapshots of old articles.

Wired reported that 23 news organizations, including USA Today and the New York Times, are among the 241 sites denying Internet Archive’s web crawler access to their articles.

هذا الخبر من فورتشن العربية. خبر يقدم أدوات ذكاء اصطناعي للتلخيص والترجمة والاستماع.

What some consider to be the digital library of Alexandria is in danger of losing valuable scrolls. Major media outlets are blocking the Internet Archive’s Wayback Machine from saving web pages to prevent AI giants from training models on snapshots of old articles.

Wired reported that 23 news organizations, including USA Today and the New York Times, are among the 241 sites denying Internet Archive’s web crawler access to their articles. It’s not personal—some outlets still use the Archive in their reporting—it’s about the looming threat of AI:

  • Tech companies can skirt copyright laws by using the Wayback Machine as a workaround for training language models on their content (including recipes, probably).
  • Mark Graham, the director of the Wayback Machine, emphasizes that the digital archive has controls to limit abuse of AI automation and prevent large-scale data extraction.

Publishers can archive their material, but a third party maintains a more incorruptible version of stories that can hold outlets accountable when it’s revised after publication.

Nothing new: Last year, Reddit barred the Wayback Machine from data scraping for similar AI concerns. The archive also lost a slew of information when federal government websites were deleted.

Still working: Graham is reportedly in talks to regain access to the material, while more than 100 media workers signed a letter supporting Wayback.—DL

This report was originally published by Morning Brew.

This story was originally featured on Fortune.com

المصدر: فورتشن العربية | Source: فورتشن العربية

ملاحظة تحريرية | Editorial Note: نُشر هذا المقال في الأصل بواسطة فورتشن العربية. خبر (Khabr) هي منصة إعلامية أردنية مرخّصة تعمل بالذكاء الاصطناعي. نضيف قيمة تحريرية من خلال: تحليل ذكي للأخبار، ملخصات تلقائية، رواية صوتية بالذكاء الاصطناعي، ترجمة متعددة اللغات، وتدقيق الحقائق. هدفنا جعل الأخبار أكثر وضوحاً وسهولةً للقارئ العربي.

This article was originally published by فورتشن العربية. Khabr is a licensed Jordanian AI-powered news platform (Registration #82086). We add editorial value through: AI-powered news analysis, automated summaries, AI audio narration, multi-language translation (Arabic, English, French, Turkish), and AI fact-checking. Our mission is to make news more accessible and understandable for Arabic-speaking audiences worldwide.

مشاركة:

المزيد عن تكنولوجيا | More on Technology

هذا الخبر ضمن تغطية خبر لقسم تكنولوجيا. نقدّم لك تحليلات ذكية وملخصات يومية لأهم الأخبار من مصادر موثوقة متعددة. المصدر: فورتشن العربية. يوجد 6 مقالات مرتبطة بهذا الموضوع.

This article is part of Khabr's coverage of Technology. We provide AI-powered analysis, summaries, and multi-source aggregation to keep you informed. Source: فورتشن العربية. Tags: AI, Internet Archive, corporate crisis.

مقالات ذات صلة

AI
يا هلا! اسألني أي شي 🎤
🔍
FREE Free 1GB Internet + Free International Calls

$1 trial — eSIM in 190+ countries — No roaming charges

Download Free