4.9B
4.9B
Nov 4, 2011
11/11
by
Internet Archive
Focused crawls are collections of frequently-updated webcrawl data from narrow (as opposed to broad or wide) web crawls, often focused on a single domain or subdomain.
Topic: webcrawl
8.3B
8.3B
Dec 16, 2004
12/04
by
Internet Archive
The Internet Archive offers over 20,000,000 freely downloadable books and texts. There is also a collection of 2.3 million modern eBooks that may be borrowed by anyone with a free archive.org account. Borrow a Book Books on Internet Archive are offered in many formats, including DAISY files intended for print disabled people. In addition to the collections here, print disabled people may access a large collection of modern books provided as encrypted DAISY files on...
Topics: Texts, Kindle, Ebook, Nook, Books, Documents
2B
2.0B
Apr 8, 2011
04/11
by
Internet Archive
Large-scale web harvests and national domain crawls performed for National Libraries, National Archives, preservation partners, research initiatives, and as part of special projects and custom crawling and research services.
Topic: ccs
3.1B
3.1B
Feb 26, 2005
02/05
by
Internet Archive
You are invited to view or upload your videos to the Community collection. These thousands of videos were contributed by Archive users and community members. These videos are available for free download. Please select a Creative Commons License during upload so that others will know what they may (or may not) do with with your video. Click here to upload your video !
Topic: Moving Images
2.3B
2.3B
Jan 18, 2005
01/05
by
Internet Archive
Texts contributed by the community. Click here to contribute your book ! For more information and how-to please see help.archive.org/hc/en-us/articles/360002360111-Uploading-A-Basic-Guide Uploaders, please note: Archive.org supports metadata about items in just about any language so long as the characters are UTF8 encoded Find books by language: Afar Books Afrikaans Books Akan Books Albanian Books Arabic Books Armenian Books Aymara Books Azerbaijan Books Balochi Books Bambara Books Bangla Books...
Topic: Texts
155M
155M
May 9, 2006
05/06
by
Internet Archive
The Open Source Software Collection includes computer programs and/or data which are licensed under an Open Source Initiative or Free Software license, or is public domain . In general, items in this collection should be software for which the source code is freely available and able to be used and distributed without undue restrictions, and/or computer data which conforms to an openly published format.
Topics: software, public domain, open source, opensource, oss, free software, gpl, gnu, public domain...
89.3M
89M
Dec 19, 2017
12/17
by
Internet Archive Web Group
A series of open web crawls targeting journal articles, technical memos, essays, datasets, and other research publications. This collection contains WARC and CDX files that end up in Wayback ( https://web.archive.org ). See also bibliographic metadata corpuses at https://archive.org/details/ia_biblio_metadata
240.9M
241M
Feb 26, 2005
02/05
by
Internet Archive
Feature films, shorts , silent films and trailers are available for viewing and downloading. Enjoy! View a list of all the Feature Films sorted by popularity . Do you want to post a feature film? First, figure out if it's in the Public Domain. Read this FAQ about determining if something is PD. If you're still not sure, post a question to the forum below with as much information about the movie as possible. One of our users might have relevant information.
Topic: Moving Images
56.7M
57M
Nov 15, 2013
11/13
by
Internet Archive
584.3M
584M
Jun 16, 2005
06/05
by
Internet Archive Canada
Welcome to the Canadian Libraries page. The Toronto scanning centre was established in 2004 on the campus of the University of Toronto . From its humble beginnings, Internet Archive Canada has worked with more 250 institutions, in providing their unique material(s) with open access and sharing these collections the world over. From the Archives of the Sisters of Service to the University of Alberta, IAC has digitized more than 600,000 unique texts as of September 2019. Many texts/collections...
Topic: Texts
86.2M
86M
Nov 15, 2013
11/13
by
Internet Archive
94M
94M
Nov 15, 2013
11/13
by
Internet Archive
88.6M
89M
Nov 15, 2013
11/13
by
Internet Archive
106.6M
107M
Nov 15, 2013
11/13
by
Internet Archive
59.4M
59M
Jan 22, 2014
01/14
by
Internet Archive
71.4M
71M
Nov 15, 2013
11/13
by
Internet Archive
69.9M
70M
Nov 15, 2013
11/13
by
Internet Archive
56.3M
56M
Nov 15, 2013
11/13
by
Internet Archive
66.6M
67M
Jan 22, 2014
01/14
by
Internet Archive
73.8M
74M
Nov 15, 2013
11/13
by
Internet Archive
51.5M
52M
Jan 22, 2014
01/14
by
Internet Archive
53.2M
53M
Jan 22, 2014
01/14
by
Internet Archive
51.3M
51M
Jan 22, 2014
01/14
by
Internet Archive
58M
58M
Nov 15, 2013
11/13
by
Internet Archive
40.7M
41M
Jan 22, 2014
01/14
by
Internet Archive
41M
41M
Jan 22, 2014
01/14
by
Internet Archive
38.2M
38M
Jan 22, 2014
01/14
by
Internet Archive
41.4M
41M
Jan 22, 2014
01/14
by
Internet Archive
40.4M
40M
Jan 22, 2014
01/14
by
Internet Archive
43M
43M
Jan 22, 2014
01/14
by
Internet Archive
38.6M
39M
Jan 22, 2014
01/14
by
Internet Archive
31.3M
31M
Jan 22, 2014
01/14
by
Internet Archive
36M
36M
Jan 22, 2014
01/14
by
Internet Archive
50.4M
50M
Jan 22, 2014
01/14
by
Internet Archive
43.4M
43M
Jan 22, 2014
01/14
by
Internet Archive
32.9M
33M
Jan 22, 2014
01/14
by
Internet Archive
33M
33M
Jan 22, 2014
01/14
by
Internet Archive
38.6M
39M
Jan 22, 2014
01/14
by
Internet Archive
27.2M
27M
Jan 22, 2014
01/14
by
Internet Archive
33.1M
33M
Jan 22, 2014
01/14
by
Internet Archive
32.9M
33M
Jan 22, 2014
01/14
by
Internet Archive
34.3M
34M
Jan 22, 2014
01/14
by
Internet Archive
32.3M
32M
Jan 22, 2014
01/14
by
Internet Archive
29.5M
30M
Jan 22, 2014
01/14
by
Internet Archive
33M
33M
Jan 22, 2014
01/14
by
Internet Archive
32.9M
33M
Jan 22, 2014
01/14
by
Internet Archive
30M
30M
Jan 22, 2014
01/14
by
Internet Archive
Internet Archive crawldata from Survey Webwide Crawl, captured by crawl413.us.archive.org:survey from Sat Dec 21 07:47:29 PST 2013 to Sat Dec 21 00:52:11 PST 2013.
Topic: crawldata
24.3M
24M
Jan 22, 2014
01/14
by
Internet Archive
29.8M
30M
Jan 22, 2014
01/14
by
Internet Archive