Skip to main content

Arquivo.pt: the Portuguese web-archive

Arquivo.pt - The Portuguese web-archive (PWA) is the national Web archive of Portugal. Its mission is to periodically archive contents of national interest available on the Web, storing and preserving for future generations information of historical relevance. It is a service of the Foundation for Science and Technology (FCT).


rss RSS

33,017
RESULTS


Show sorted alphabetically

Show sorted alphabetically

SHOW DETAILS
up-solid down-solid
eye
Title
Date Archived
Creator
Arquivo.pt: the Portuguese web-archive
web

eye 19,050

favorite 0

comment 0

Incremental crawl of the Portuguese web performed between 17 May 2011 and 17 June 2011 mainly from .PT domain. The AWP10 crawl is incremental because it was performed using DeDuplicator (http://landsbokasafn.github.io/DeDuplicator/) taking the content of AWP7 as baseline. Thus, the files that remained unchanged from the AWP7 complete crawl were not archived (duplicated) on the AWP10 incremental crawl.
Topics: Incremental crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Arquivo.pt: the Portuguese web-archive
web

eye 13,035

favorite 0

comment 0

Incremental crawl of the Portuguese web performed between 17 May 2011 and 17 June 2011 mainly from .PT domain. The AWP10 crawl is incremental because it was performed using DeDuplicator (http://landsbokasafn.github.io/DeDuplicator/) taking the content of AWP7 as baseline. Thus, the files that remained unchanged from the AWP7 complete crawl were not archived (duplicated) on the AWP10 incremental crawl.
Topics: Incremental crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Thirty Three collection AWP. Without Deduplicator.
Topics: Complete crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Arquivo.pt: the Portuguese web-archive
web

eye 27,452

favorite 0

comment 0

Incremental crawl of the Portuguese web performed between 17 May 2011 and 17 June 2011 mainly from .PT domain. The AWP10 crawl is incremental because it was performed using DeDuplicator (http://landsbokasafn.github.io/DeDuplicator/) taking the content of AWP7 as baseline. Thus, the files that remained unchanged from the AWP7 complete crawl were not archived (duplicated) on the AWP10 incremental crawl.
Topics: Incremental crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Thirty Three collection AWP. Without Deduplicator.
Topics: Complete crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Thirty Three collection AWP. Without Deduplicator.
Topics: Complete crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Thirty Three collection AWP. Without Deduplicator.
Topics: Complete crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Thirty Three collection AWP. Without Deduplicator.
Topics: Complete crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Thirty Three collection AWP. Without Deduplicator.
Topics: Complete crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Thirty Three collection AWP. Without Deduplicator.
Topics: Complete crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Thirty Three collection AWP. Without Deduplicator.
Topics: Complete crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Arquivo.pt: the Portuguese web-archive
web

eye 14,015

favorite 0

comment 0

Incremental crawl of the Portuguese web performed between 17 May 2011 and 17 June 2011 mainly from .PT domain. The AWP10 crawl is incremental because it was performed using DeDuplicator (http://landsbokasafn.github.io/DeDuplicator/) taking the content of AWP7 as baseline. Thus, the files that remained unchanged from the AWP7 complete crawl were not archived (duplicated) on the AWP10 incremental crawl.
Topics: Incremental crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Arquivo.pt: the Portuguese web-archive
web

eye 10,911

favorite 0

comment 0

Incremental crawl of the Portuguese web performed between 30 June 2011 and 5 August 2011 mainly from .PT domain. The AWP11 crawl is incremental because it was performed using DeDuplicator (http://landsbokasafn.github.io/DeDuplicator/) taking the content of AWP7 as baseline. Thus, the files that remained unchanged from the AWP7 complete crawl were not archived (duplicated) on the AWP11 incremental crawl.
Topics: Incremental crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Arquivo.pt: the Portuguese web-archive
web

eye 12,366

favorite 0

comment 0

Incremental crawl of the Portuguese web performed between 30 June 2011 and 5 August 2011 mainly from .PT domain. The AWP11 crawl is incremental because it was performed using DeDuplicator (http://landsbokasafn.github.io/DeDuplicator/) taking the content of AWP7 as baseline. Thus, the files that remained unchanged from the AWP7 complete crawl were not archived (duplicated) on the AWP11 incremental crawl.
Topics: Incremental crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Arquivo.pt: the Portuguese web-archive
web

eye 11,895

favorite 0

comment 0

Incremental crawl of the Portuguese web performed between 17 May 2011 and 17 June 2011 mainly from .PT domain. The AWP10 crawl is incremental because it was performed using DeDuplicator (http://landsbokasafn.github.io/DeDuplicator/) taking the content of AWP7 as baseline. Thus, the files that remained unchanged from the AWP7 complete crawl were not archived (duplicated) on the AWP10 incremental crawl.
Topics: Incremental crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
First collection FAWP. Without Deduplicator.
Topics: Frequent crawl of news media from Portuguese web, Portuguese Web Archive, Portuguese online...
Thirty Three collection AWP. Without Deduplicator.
Topics: Complete crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Thirty Three collection AWP. Without Deduplicator.
Topics: Complete crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Thirty Three collection AWP. Without Deduplicator.
Topics: Complete crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Thirty Three collection AWP. Without Deduplicator.
Topics: Complete crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Thirty Three collection AWP. Without Deduplicator.
Topics: Complete crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Thirty Three collection AWP. Without Deduplicator.
Topics: Complete crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Thirty Three collection AWP. Without Deduplicator.
Topics: Complete crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Thirty Three collection AWP. Without Deduplicator.
Topics: Complete crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Thirty Three collection AWP. Without Deduplicator.
Topics: Complete crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Sixth collection FAWP. With Deduplicator.
Topics: Frequent crawl of news media from Portuguese web, Portuguese Web Archive, Portuguese online...
Arquivo.pt: the Portuguese web-archive
web

eye 14,280

favorite 0

comment 0

Incremental crawl of the Portuguese web performed between 17 May 2011 and 17 June 2011 mainly from .PT domain. The AWP10 crawl is incremental because it was performed using DeDuplicator (http://landsbokasafn.github.io/DeDuplicator/) taking the content of AWP7 as baseline. Thus, the files that remained unchanged from the AWP7 complete crawl were not archived (duplicated) on the AWP10 incremental crawl.
Topics: Incremental crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Thirty Three collection AWP. Without Deduplicator.
Topics: Complete crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Thirty Three collection AWP. Without Deduplicator.
Topics: Complete crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Thirty Three collection AWP. Without Deduplicator.
Topics: Complete crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Thirty Three collection AWP. Without Deduplicator.
Topics: Complete crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Thirty Three collection AWP. Without Deduplicator.
Topics: Complete crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Thirty Three collection AWP. Without Deduplicator.
Topics: Complete crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Thirty Three collection AWP. Without Deduplicator.
Topics: Complete crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Arquivo.pt: the Portuguese web-archive
web

eye 16,754

favorite 0

comment 0

Incremental crawl of the Portuguese web performed between 17 May 2011 and 17 June 2011 mainly from .PT domain. The AWP10 crawl is incremental because it was performed using DeDuplicator (http://landsbokasafn.github.io/DeDuplicator/) taking the content of AWP7 as baseline. Thus, the files that remained unchanged from the AWP7 complete crawl were not archived (duplicated) on the AWP10 incremental crawl.
Topics: Incremental crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Arquivo.pt: the Portuguese web-archive
web

eye 16,783

favorite 0

comment 0

Incremental crawl of the Portuguese web performed between 17 May 2011 and 17 June 2011 mainly from .PT domain. The AWP10 crawl is incremental because it was performed using DeDuplicator (http://landsbokasafn.github.io/DeDuplicator/) taking the content of AWP7 as baseline. Thus, the files that remained unchanged from the AWP7 complete crawl were not archived (duplicated) on the AWP10 incremental crawl.
Topics: Incremental crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Thirty Three collection AWP. Without Deduplicator.
Topics: Complete crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Thirty Three collection AWP. Without Deduplicator.
Topics: Complete crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Arquivo.pt: the Portuguese web-archive
web

eye 19,564

favorite 0

comment 0

Incremental crawl of the Portuguese web performed between 17 May 2011 and 17 June 2011 mainly from .PT domain. The AWP10 crawl is incremental because it was performed using DeDuplicator (http://landsbokasafn.github.io/DeDuplicator/) taking the content of AWP7 as baseline. Thus, the files that remained unchanged from the AWP7 complete crawl were not archived (duplicated) on the AWP10 incremental crawl.
Topics: Incremental crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Thirty Three collection AWP. Without Deduplicator.
Topics: Complete crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Thirty Three collection AWP. Without Deduplicator.
Topics: Complete crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Twenty-seventh collection AWP. No Deduplicator.
Topics: Complete crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Arquivo.pt: the Portuguese web-archive
web

eye 2,154

favorite 0

comment 0

Incremental crawl of the Portuguese web performed between 13 August 2015 and 5 November 2015 mainly from .PT domain. The AWP18 crawl is incremental because it was performed using DeDuplicator (http://landsbokasafn.github.io/DeDuplicator/) taking the content of AWP17 as baseline. Thus, the files that remained unchanged from the AWP17 complete crawl were not archived (duplicated) on the AWP18 incremental crawl.
Topics: Incremental crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Thirty collection AWP. With Deduplicator.
Topics: Complete crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Thirty Three collection AWP. Without Deduplicator.
Topics: Complete crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Thirty Three collection AWP. Without Deduplicator.
Topics: Complete crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Thirty Three collection AWP. Without Deduplicator.
Topics: Complete crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Thirty Three collection AWP. Without Deduplicator.
Topics: Complete crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Thirty Three collection AWP. Without Deduplicator.
Topics: Complete crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Thirty Three collection AWP. Without Deduplicator.
Topics: Complete crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Thirty Three collection AWP. Without Deduplicator.
Topics: Complete crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Thirty Three collection AWP. Without Deduplicator.
Topics: Complete crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Thirty Three collection AWP. Without Deduplicator.
Topics: Complete crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Thirty Three collection AWP. Without Deduplicator.
Topics: Complete crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Thirty Three collection AWP. Without Deduplicator.
Topics: Complete crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Thirty Three collection AWP. Without Deduplicator.
Topics: Complete crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Thirty Three collection AWP. Without Deduplicator.
Topics: Complete crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Thirty Three collection AWP. Without Deduplicator.
Topics: Complete crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Thirty Three collection AWP. Without Deduplicator.
Topics: Complete crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Thirty Three collection AWP. Without Deduplicator.
Topics: Complete crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Thirty Three collection AWP. Without Deduplicator.
Topics: Complete crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Thirty Three collection AWP. Without Deduplicator.
Topics: Complete crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Thirty Three collection AWP. Without Deduplicator.
Topics: Complete crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Thirty Three collection AWP. Without Deduplicator.
Topics: Complete crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Thirty Three collection AWP. Without Deduplicator.
Topics: Complete crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Thirty Three collection AWP. Without Deduplicator.
Topics: Complete crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Thirty Three collection AWP. Without Deduplicator.
Topics: Complete crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Thirty Three collection AWP. Without Deduplicator.
Topics: Complete crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Thirty Three collection AWP. Without Deduplicator.
Topics: Complete crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Sixth collection FAWP. With Deduplicator.
Topics: Frequent crawl of news media from Portuguese web, Portuguese Web Archive, Portuguese online...
Thirty Three collection AWP. Without Deduplicator.
Topics: Complete crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Thirty Three collection AWP. Without Deduplicator.
Topics: Complete crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Thirty Three collection AWP. Without Deduplicator.
Topics: Complete crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Thirty Three collection AWP. Without Deduplicator.
Topics: Complete crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Thirty Three collection AWP. Without Deduplicator.
Topics: Complete crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...