2018-04-24: Why we need multiple web archives: the case of blog.reidreport.com

This story started in December, 2017 with Joy-Ann Reid (of MSNBC) apologizing for "insensitive LGBT blog posts" that she wrote on her blog many years ago when she was a morning radio talk show host in Florida. This apology was, at least in some quarters, (begrudgingly) accepted. Today's update was news that Reid and her lawyers had in December claimed that either her blog, and/or the Internet Archive's record of the blog had been hacked (Mediaite, The Intercept). Later today, the Internet Archive issued a blog post deny the claim that it was hacked, stating:

This past December, Reid’s lawyers contacted us, asking to have archives of the blog (blog.reidreport.com) taken down, stating that “fraudulent” posts were “inserted into legitimate content” in our archives of the blog. Her attorneys stated that they didn’t know if the alleged insertion happened on the original site or with our archives (Reid’s claim regarding the point of manipulation is still unclear to us).

...

At some point after our correspondence, a robots.txt exclusion request specific to the Wayback Machine was placed on the live blog. That request was automatically recognized and processed by the Wayback Machine and the blog archives were excluded, unbeknownst to us (the process is fully automated). The robots.txt exclusion from the web archive remains automatically in effect due to the presence of the request on the live blog.

Checking the Internet Archive for robots.txt, we can see that on 2018-02-16 blog.reidreport.com had a standard robots.txt page that blocked the admin section of WordPress, but by 2018-02-21 they had a version that blocked all robots, and as of today (2018-04-24) they had a version that specifically blocked only the Internet Archive's crawler ("ia_archiver"). As of about 5pm EDT, the robots.txt file had been removed (probably because of the Internet Archive's blog post calling out the presence of the robots.txt; cf. a similar situation in 2013 with the Conservative Party in the UK), but it may take a while for the Internet Archive to register its absence.

2018-04-25 update: Thanks to Peter Sterne for pointing out that www.blog.reidreport.com/robots.txt still exists, even though blog.reidreport.com/robots.txt does not. They technically can be two different URLs though the convention is for them to canonicalize to the same URL (which is what the Wayback Machine does). HTTP session info provided below, but the summary is that robots.txt is still in effect and the need for other web archives is still paramount.

Until the Internet Archive begins serving blog.reidreport.com again, this is a good time to remind everyone that there are web archives other than the Internet Archive. The screen shot above shows the Memento Time Travel service, which searches about 26 public web archives. In this case, it found mementos (i.e., captures of web pages) in five different web archives: Archive-It (a subsidiary of the Internet Archive), Bibliotheca Alexandrina (the Egyptian Web Archive), the National Library of Ireland, the archive.is on-demand archiving service, and the Library of Congress. For a machine readable service, below I list the TimeMap (list of mementos) generated by our MemGator service; the details aren't important but it is the source of the URLs that will appear next.

Beginning with the original tweets by @Jamie_Maz (2017-11-30 thread, 2018-04-18 thread), I scanned through the screen shots (no URLs were given) and looked for screen shots that had definitive datetimes (most images did not have them). The datetimes are (with ones for which we have evidence in bold, and the ones that we inferred by matching text are maked with "(inferred)"):

2005-04-25

2005-07-16

2005-07-21

2006-01-20 (inferred)

2006-06-05

2006-06-13 (inferred)

2006-10-03

2006-12-23

2007-02-21

2008-07-04

2008-10-16

2009-01-15

(update: because of canonicalization errors, some of the URLs are not being excluded; see below)

Most of those dates are pretty early in web archiving times, when the Internet Archive was the only archive commonly available, and many (all?) of the mementos in other web archives were surely originally crawled by the Internet Archive, even if on a contract basis (e.g., for the Library of Congress). Nonetheless, with multiple copies geographically and administratively dispersed throughout the globe, an adversary would have had to hack multiple web archives and alter their contents (cf. lockss.org), or have hacked the original site (blog.reidreport.com) approximately 12 years ago for adulterated pages to have been hosted at all the different web archives. While both scenarios are technically possible, they are extraordinarily unlikely.

While we don't know the totality of the hacking claims, we can offer three archived web pages, hosted at the Library of Congress web archive (webarchive.loc.gov), that corroborate at least some of the claims @Jamie_Maz.

2006-01-20

30/x Joy seemed very interested in Brokeback Mountain, but wouldn't watch it bc it featured two men hooking up.

She can't understand who is going to see it since she imagines everyone would be turned off by it. pic.twitter.com/ogBnPyDSUF
— Not a bot (@Jamie_Maz) April 18, 2018

Evidence for this tweet can be found at (approximately midway): http://webarchive.loc.gov/all/20060125004941/http://blog.reidreport.com/

2006-06-05

19/x Joy - I am not a gay marriage supporter pic.twitter.com/wbf2QSkx7n
— Not a bot (@Jamie_Maz) April 18, 2018

Evidence for this tweet can be found at (approximately 2/3 down): http://webarchive.loc.gov/all/20060608144033/http://blog.reidreport.com/

2006-06-13

I'm not sure this evidence maps directly to one of tweets, but it fits the general theme of anti-Charlie Crist: http://webarchive.loc.gov/all/20060615134635/http://blog.reidreport.com/

This memento also exists at archive.is; it is a copy of the Internet Archive's copy but it is not blocked by robots.txt because it is in another archive: http://archive.is/20060615134635/http://blog.reidreport.com/

2006-10-03

10/x Of course there are even more posts about Charlie Crist.

For some strange reason Joy posts a link to an article that claims Crist was involved in gay sex parties with Mark Foley. pic.twitter.com/qNazjxYQ7K
— Not a bot (@Jamie_Maz) April 18, 2018

Evidence for this tweet can be found at (approximately midway): http://webarchive.loc.gov/all/20061010125903/http://blog.reidreport.com/

2008-10-16

7/x Joy calls Crist "Miss Charlie" and again declares his potential wedding to a women is a fraud and a "veep marketing strategy". pic.twitter.com/ZMCbEfURfn
— Not a bot (@Jamie_Maz) November 30, 2017

Evidence for this tweet can be found at (approximately 1/3 down): http://webarchive.loc.gov/all/20081018020856/http://blog.reidreport.com/

In summary, of the many examples that @Jamie_Maz provides, I can find five copies in the Library of Congress's web archive. These crawls were probably performed on behalf of the Library of Congress by the Internet Archive (for election-based coverage); even though there are many different (and independent) web archives now, in 2006 the Internet Archive was pretty much the only game in town. Even though these mementos are not independent observations, there is no plausible scenario for these copies to have been hacked in multiple web archives or at the original blog 10+ years ago. There may be additional evidence in the other web archives, but I haven't exhaustively searched them.

We don't know the full details of what Reid's lawyers alleged, so perhaps there are details that we don't know. But the analysis from the Internet Archive crawl engineers, plus evidence in separate web archives suggest that the claim has no merit.

The case of blog.reidreport.com is another example of why we need multiple web archives.

--Michael

Thanks to Prof. Michele Weigle and John Berlin for bringing this issue to my attention and uncovering some of the examples.

Memento TimeMap for blog.reidreport.com:

% curl -i https://memgator.cs.odu.edu/timemap/link/http://blog.reidreport.com/ HTTP/1.1 200 OK Access-Control-Allow-Origin: * Access-Control-Expose-Headers: Link, Location, X-Memento-Count, X-Generator Content-Type: application/link-format Date: Tue, 24 Apr 2018 20:39:32 GMT X-Generator: MemGator:1.0-rc7 X-Memento-Count: 174 Transfer-Encoding: chunked  <http:></http:>; rel="original", <https:></https:>; rel="self"; type="application/link-format", <http:></http:>; rel="first memento"; datetime="Tue, 13 Dec 2005 06:37:57 GMT", <http:></http:>; rel="memento"; datetime="Tue, 27 Dec 2005 07:11:34 GMT", <http:></http:>; rel="memento"; datetime="Mon, 09 Jan 2006 23:38:24 GMT", <http:></http:>; rel="memento"; datetime="Wed, 11 Jan 2006 22:17:38 GMT", <http:></http:>; rel="memento"; datetime="Fri, 13 Jan 2006 22:17:54 GMT", <http:></http:>; rel="memento"; datetime="Tue, 17 Jan 2006 04:00:21 GMT", <http:></http:>; rel="memento"; datetime="Wed, 25 Jan 2006 00:49:41 GMT", <http:></http:>; rel="memento"; datetime="Mon, 30 Jan 2006 21:27:07 GMT", <http:></http:>; rel="memento"; datetime="Tue, 07 Feb 2006 04:37:23 GMT", <http:></http:>; rel="memento"; datetime="Tue, 14 Feb 2006 02:11:36 GMT", <http:></http:>; rel="memento"; datetime="Fri, 02 Jun 2006 12:01:19 GMT", <http:></http:>; rel="memento"; datetime="Thu, 08 Jun 2006 14:40:33 GMT", <http:></http:>; rel="memento"; datetime="Thu, 15 Jun 2006 13:46:35 GMT", <http:></http:>; rel="memento"; datetime="Thu, 15 Jun 2006 13:46:35 GMT", <http:></http:>; rel="memento"; datetime="Fri, 29 Sep 2006 09:35:09 GMT", <http:></http:>; rel="memento"; datetime="Tue, 10 Oct 2006 12:59:03 GMT", <http:></http:>; rel="memento"; datetime="Thu, 19 Oct 2006 21:33:57 GMT", <http:></http:>; rel="memento"; datetime="Sun, 19 Nov 2006 12:46:09 GMT", <http:></http:>; rel="memento"; datetime="Tue, 19 Dec 2006 12:28:32 GMT", <http:></http:>; rel="memento"; datetime="Tue, 02 Jan 2007 04:08:34 GMT", <http:></http:>; rel="memento"; datetime="Sun, 14 Jan 2007 01:52:13 GMT", <http:></http:>; rel="memento"; datetime="Sun, 13 May 2007 09:35:53 GMT", <http:></http:>; rel="memento"; datetime="Mon, 17 Dec 2007 22:54:56 GMT", <http:></http:>; rel="memento"; datetime="Sun, 13 Jan 2008 23:01:46 GMT", <http:></http:>; rel="memento"; datetime="Thu, 14 Feb 2008 15:34:40 GMT", <http:></http:>; rel="memento"; datetime="Fri, 29 Aug 2008 14:53:25 GMT", <http:></http:>; rel="memento"; datetime="Thu, 04 Sep 2008 17:09:37 GMT", <http:></http:>; rel="memento"; datetime="Sat, 13 Sep 2008 11:06:33 GMT", <http:></http:>; rel="memento"; datetime="Mon, 22 Sep 2008 19:57:42 GMT", <http:></http:>; rel="memento"; datetime="Fri, 26 Sep 2008 15:47:52 GMT", <http:></http:>; rel="memento"; datetime="Thu, 02 Oct 2008 22:37:53 GMT", <http:></http:>; rel="memento"; datetime="Thu, 09 Oct 2008 21:02:02 GMT", <http:></http:>; rel="memento"; datetime="Sat, 18 Oct 2008 02:08:56 GMT", <http:></http:>; rel="memento"; datetime="Sun, 26 Oct 2008 03:28:23 GMT", <http:></http:>; rel="memento"; datetime="Sat, 01 Nov 2008 23:14:44 GMT", <http:></http:>; rel="memento"; datetime="Fri, 07 Nov 2008 19:08:50 GMT", <http:></http:>; rel="memento"; datetime="Fri, 14 Nov 2008 19:29:33 GMT", <http:></http:>; rel="memento"; datetime="Sat, 29 Nov 2008 22:26:46 GMT", <http:></http:>; rel="memento"; datetime="Fri, 07 Aug 2009 19:22:02 GMT", <http:></http:>; rel="memento"; datetime="Sun, 06 Sep 2009 03:43:48 GMT", <http:></http:>; rel="memento"; datetime="Mon, 23 Nov 2009 07:26:35 GMT", <http:></http:>; rel="memento"; datetime="Mon, 23 Nov 2009 07:26:35 GMT", <http:></http:>; rel="memento"; datetime="Tue, 08 Jun 2010 13:09:17 GMT", <http:></http:>; rel="memento"; datetime="Wed, 08 Sep 2010 15:06:01 GMT", <http:></http:>; rel="memento"; datetime="Wed, 08 Sep 2010 15:06:01 GMT", <http:></http:>; rel="memento"; datetime="Sun, 17 Oct 2010 18:08:28 GMT", <http:></http:>; rel="memento"; datetime="Thu, 21 Oct 2010 20:44:35 GMT", <http:></http:>; rel="memento"; datetime="Sat, 23 Oct 2010 14:39:57 GMT", <http:></http:>; rel="memento"; datetime="Sat, 23 Oct 2010 14:39:57 GMT", <http:></http:>; rel="memento"; datetime="Fri, 29 Oct 2010 01:03:31 GMT", <http:></http:>; rel="memento"; datetime="Thu, 04 Nov 2010 23:39:18 GMT", <http:></http:>; rel="memento"; datetime="Thu, 11 Nov 2010 20:52:48 GMT", <http:></http:>; rel="memento"; datetime="Thu, 18 Nov 2010 12:52:39 GMT", <http:></http:>; rel="memento"; datetime="Thu, 25 Nov 2010 13:04:03 GMT", <http:></http:>; rel="memento"; datetime="Thu, 02 Dec 2010 21:13:57 GMT", <http:></http:>; rel="memento"; datetime="Fri, 03 Dec 2010 22:33:09 GMT", <http:></http:>; rel="memento"; datetime="Fri, 03 Dec 2010 22:33:09 GMT", <http:></http:>; rel="memento"; datetime="Sat, 04 Dec 2010 13:00:37 GMT", <http:></http:>; rel="memento"; datetime="Fri, 10 Dec 2010 22:04:16 GMT", <http:></http:>; rel="memento"; datetime="Fri, 10 Dec 2010 22:04:16 GMT", <http:></http:>; rel="memento"; datetime="Sat, 18 Dec 2010 02:25:03 GMT", <http:></http:>; rel="memento"; datetime="Sat, 25 Dec 2010 01:14:55 GMT", <http:></http:>; rel="memento"; datetime="Sat, 01 Jan 2011 10:29:29 GMT", <http:></http:>; rel="memento"; datetime="Sun, 02 Jan 2011 12:42:25 GMT", <http:></http:>; rel="memento"; datetime="Mon, 10 Jan 2011 19:21:23 GMT", <http:></http:>; rel="memento"; datetime="Sat, 15 Jan 2011 14:10:29 GMT", <http:></http:>; rel="memento"; datetime="Sat, 29 Jan 2011 08:10:21 GMT", <http:></http:>; rel="memento"; datetime="Mon, 31 Jan 2011 23:54:56 GMT", <http:></http:>; rel="memento"; datetime="Wed, 02 Feb 2011 02:23:38 GMT", <http:></http:>; rel="memento"; datetime="Sat, 05 Feb 2011 15:35:52 GMT", <http:></http:>; rel="memento"; datetime="Tue, 08 Feb 2011 00:21:06 GMT", <http:></http:>; rel="memento"; datetime="Sat, 19 Feb 2011 17:35:53 GMT", <http:></http:>; rel="memento"; datetime="Fri, 04 Mar 2011 21:33:16 GMT", <http:></http:>; rel="memento"; datetime="Sun, 06 Mar 2011 07:40:27 GMT", <http:></http:>; rel="memento"; datetime="Mon, 07 Mar 2011 14:47:06 GMT", <http:></http:>; rel="memento"; datetime="Thu, 10 Mar 2011 14:05:43 GMT", <http:></http:>; rel="memento"; datetime="Fri, 11 Mar 2011 19:27:05 GMT", <http:></http:>; rel="memento"; datetime="Mon, 21 Mar 2011 17:02:36 GMT", <http:></http:>; rel="memento"; datetime="Thu, 24 Mar 2011 21:38:16 GMT", <http:></http:>; rel="memento"; datetime="Tue, 29 Mar 2011 05:31:24 GMT", <http:></http:>; rel="memento"; datetime="Wed, 30 Mar 2011 17:00:39 GMT", <http:></http:>; rel="memento"; datetime="Wed, 06 Apr 2011 22:31:19 GMT", <http:></http:>; rel="memento"; datetime="Thu, 14 Apr 2011 01:19:42 GMT", <http:></http:>; rel="memento"; datetime="Sat, 16 Apr 2011 10:08:48 GMT", <http:></http:>; rel="memento"; datetime="Wed, 20 Apr 2011 15:45:44 GMT", <http:></http:>; rel="memento"; datetime="Wed, 27 Apr 2011 20:17:27 GMT", <http:></http:>; rel="memento"; datetime="Wed, 04 May 2011 13:59:20 GMT", <http:></http:>; rel="memento"; datetime="Fri, 20 May 2011 04:52:29 GMT", <http:></http:>; rel="memento"; datetime="Fri, 27 May 2011 18:39:51 GMT", <http:></http:>; rel="memento"; datetime="Thu, 02 Jun 2011 13:53:15 GMT", <http:></http:>; rel="memento"; datetime="Wed, 08 Jun 2011 09:00:12 GMT", <http:></http:>; rel="memento"; datetime="Fri, 10 Jun 2011 11:36:20 GMT", <http:></http:>; rel="memento"; datetime="Wed, 15 Jun 2011 13:11:17 GMT", <http:></http:>; rel="memento"; datetime="Wed, 22 Jun 2011 11:38:49 GMT", <http:></http:>; rel="memento"; datetime="Sat, 02 Jul 2011 04:01:34 GMT", <http:></http:>; rel="memento"; datetime="Wed, 06 Jul 2011 23:17:37 GMT", <http:></http:>; rel="memento"; datetime="Wed, 13 Jul 2011 17:30:24 GMT", <http:></http:>; rel="memento"; datetime="Thu, 21 Jul 2011 09:26:04 GMT", <http:></http:>; rel="memento"; datetime="Thu, 28 Jul 2011 20:50:32 GMT", <http:></http:>; rel="memento"; datetime="Fri, 29 Jul 2011 09:24:10 GMT", <http:></http:>; rel="memento"; datetime="Thu, 04 Aug 2011 05:48:17 GMT", <http:></http:>; rel="memento"; datetime="Fri, 05 Aug 2011 15:26:39 GMT", <http:></http:>; rel="memento"; datetime="Thu, 11 Aug 2011 05:19:14 GMT", <http:></http:>; rel="memento"; datetime="Thu, 11 Aug 2011 05:24:15 GMT", <http:></http:>; rel="memento"; datetime="Wed, 17 Aug 2011 22:56:34 GMT", <http:></http:>; rel="memento"; datetime="Wed, 24 Aug 2011 09:54:45 GMT", <http:></http:>; rel="memento"; datetime="Sat, 10 Sep 2011 22:09:09 GMT", <http:></http:>; rel="memento"; datetime="Sun, 27 Nov 2011 12:49:34 GMT", <http:></http:>; rel="memento"; datetime="Mon, 28 Nov 2011 19:08:33 GMT", <http:></http:>; rel="memento"; datetime="Thu, 16 Feb 2012 19:11:31 GMT", <http:></http:>; rel="memento"; datetime="Fri, 10 Aug 2012 23:51:03 GMT", <http:></http:>; rel="memento"; datetime="Sat, 18 Aug 2012 05:12:23 GMT", <http:></http:>; rel="memento"; datetime="Fri, 24 Aug 2012 00:36:55 GMT", <http:></http:>; rel="memento"; datetime="Thu, 30 Aug 2012 03:12:37 GMT", <http:></http:>; rel="memento"; datetime="Wed, 05 Sep 2012 21:26:20 GMT", <http:></http:>; rel="memento"; datetime="Thu, 20 Sep 2012 04:39:05 GMT", <http:></http:>; rel="memento"; datetime="Fri, 28 Sep 2012 20:54:35 GMT", <http:></http:>; rel="memento"; datetime="Fri, 05 Oct 2012 09:02:12 GMT", <http:></http:>; rel="memento"; datetime="Fri, 12 Oct 2012 14:26:52 GMT", <http:></http:>; rel="memento"; datetime="Tue, 06 Nov 2012 21:45:50 GMT", <http:></http:>; rel="memento"; datetime="Tue, 13 Nov 2012 21:34:24 GMT", <http:></http:>; rel="memento"; datetime="Thu, 22 Nov 2012 03:51:16 GMT", <http:></http:>; rel="memento"; datetime="Wed, 28 Nov 2012 01:26:55 GMT", <http:></http:>; rel="memento"; datetime="Thu, 06 Dec 2012 07:38:47 GMT", <http:></http:>; rel="memento"; datetime="Sat, 08 Dec 2012 10:48:25 GMT", <http:></http:>; rel="memento"; datetime="Sun, 09 Dec 2012 11:25:53 GMT", <http:></http:>; rel="memento"; datetime="Wed, 12 Dec 2012 15:21:12 GMT", <http:></http:>; rel="memento"; datetime="Wed, 19 Dec 2012 20:15:42 GMT", <http:></http:>; rel="memento"; datetime="Sat, 22 Dec 2012 08:35:28 GMT", <http:></http:>; rel="memento"; datetime="Fri, 28 Dec 2012 06:20:56 GMT", <http:></http:>; rel="memento"; datetime="Thu, 03 Jan 2013 13:19:28 GMT", <http:></http:>; rel="memento"; datetime="Fri, 04 Jan 2013 12:19:10 GMT", <http:></http:>; rel="memento"; datetime="Sat, 05 Jan 2013 08:38:57 GMT", <http:></http:>; rel="memento"; datetime="Wed, 09 Jan 2013 09:44:17 GMT", <http:></http:>; rel="memento"; datetime="Wed, 16 Jan 2013 23:39:57 GMT", <http:></http:>; rel="memento"; datetime="Wed, 23 Jan 2013 22:23:46 GMT", <http:></http:>; rel="memento"; datetime="Fri, 08 Mar 2013 15:08:01 GMT", <http:></http:>; rel="memento"; datetime="Sat, 09 Mar 2013 02:33:50 GMT", <http:></http:>; rel="memento"; datetime="Sat, 20 Apr 2013 08:26:37 GMT", <http:></http:>; rel="memento"; datetime="Sat, 20 Apr 2013 09:07:21 GMT", <http:></http:>; rel="memento"; datetime="Sat, 20 Apr 2013 19:37:56 GMT", <http:></http:>; rel="memento"; datetime="Mon, 22 Apr 2013 07:37:07 GMT", <http:></http:>; rel="memento"; datetime="Sat, 08 Jun 2013 12:18:08 GMT", <http:></http:>; rel="memento"; datetime="Wed, 07 Aug 2013 09:33:21 GMT", <http:></http:>; rel="memento"; datetime="Sun, 08 Sep 2013 14:42:36 GMT", <http:></http:>; rel="memento"; datetime="Sat, 28 Sep 2013 00:11:44 GMT", <http:></http:>; rel="memento"; datetime="Sat, 19 Oct 2013 03:40:11 GMT", <http:></http:>; rel="memento"; datetime="Sun, 20 Oct 2013 00:51:13 GMT", <http:></http:>; rel="memento"; datetime="Sun, 20 Oct 2013 08:19:55 GMT", <http:></http:>; rel="memento"; datetime="Fri, 01 Nov 2013 00:17:23 GMT", <http:></http:>; rel="memento"; datetime="Sun, 08 Dec 2013 03:22:37 GMT", <http:></http:>; rel="memento"; datetime="Mon, 09 Dec 2013 19:11:58 GMT", <http:></http:>; rel="memento"; datetime="Fri, 20 Dec 2013 17:01:05 GMT", <http:></http:>; rel="memento"; datetime="Tue, 24 Dec 2013 22:19:04 GMT", <http:></http:>; rel="memento"; datetime="Sat, 04 Jan 2014 20:17:27 GMT", <http:></http:>; rel="memento"; datetime="Fri, 10 Jan 2014 10:11:50 GMT", <http:></http:>; rel="memento"; datetime="Sat, 25 Jan 2014 08:11:53 GMT", <http:></http:>; rel="memento"; datetime="Tue, 25 Feb 2014 00:03:47 GMT", <http:></http:>; rel="memento"; datetime="Sat, 08 Mar 2014 21:21:13 GMT", <http:></http:>; rel="memento"; datetime="Sun, 08 Jun 2014 12:10:32 GMT", <http:></http:>; rel="memento"; datetime="Tue, 09 Sep 2014 05:31:10 GMT", <http:></http:>; rel="memento"; datetime="Sat, 08 Aug 2015 05:49:42 GMT", <http:></http:>; rel="memento"; datetime="Fri, 16 Feb 2018 09:14:05 GMT", <http:></http:>; rel="memento"; datetime="Sat, 17 Feb 2018 23:51:22 GMT", <http:></http:>; rel="memento"; datetime="Sun, 18 Feb 2018 20:00:12 GMT", <http:></http:>; rel="memento"; datetime="Mon, 19 Feb 2018 20:35:51 GMT", <http:></http:>; rel="memento"; datetime="Tue, 20 Feb 2018 21:48:48 GMT", <http:></http:>; rel="memento"; datetime="Wed, 21 Feb 2018 22:02:48 GMT", <http:></http:>; rel="memento"; datetime="Thu, 22 Feb 2018 22:23:22 GMT", <http:></http:>; rel="memento"; datetime="Fri, 23 Feb 2018 19:59:12 GMT", <http:></http:>; rel="memento"; datetime="Sat, 24 Feb 2018 21:03:58 GMT", <http:></http:>; rel="memento"; datetime="Sun, 25 Feb 2018 18:56:18 GMT", <http:></http:>; rel="memento"; datetime="Mon, 26 Feb 2018 19:37:17 GMT", <http:></http:>; rel="last memento"; datetime="Tue, 27 Feb 2018 19:34:59 GMT", <https:></https:>; rel="timemap"; type="application/link-format", <https:></https:>; rel="timemap"; type="application/json", <https:></https:>; rel="timemap"; type="application/cdxj+ors", <https:></https:>; rel="timegate"

2018-04-25 update: As noted above, Peter Sterne brought to my attention that the non-standard URL of www.blog.reidreport.com/robots.txt still exists (and is blocking "ia_archiver") even though the more standard blog.reidreport.com/robots.txt is 404.

Another 2018-04-25 update: The NYT has covered the story ("MSNBC Host Joy Reid Blames Hackers for Anti-Gay Blog Posts, but Questions Mount"), and there was an interview with Reid's computer security expert ("Should We Believe Joy Reid’s Blog Was Hacked? This Security Consultant Says We Should"), Jonathon Nichols.

I embed a statement from Nichols (released by Erik Wemple), and a tweet from Nichols clarifying that they were not suggesting that Wayback Machine's mementos were hacked, but rather the hacked blog was crawled by the Internet Archive.

This is where it's important to note that there maybe a discrepancy between the posts that Nichols is concerned with and those that @Jamie_Maz surfaced. There is (semi-)independent evidence of @Jamie_Maz's pages, with the ultimate implication that for those pages to have been the result of a hack, blog.reidreport.com would have had to been hacked as many as 12 years ago -- and for nobody to have noticed at the time.

Reid (& Nichols) could always unblock the Internet Archive and share the evidence of the hack.

Here's the statement of security consultant Jonathan Nichols regarding the claims of blog-hacking by MSNBC's Joy Reid. pic.twitter.com/wGAui8Mfa5
— ErikWemple (@ErikWemple) April 25, 2018

1) WayBack was hacked
2) Joy was hacked
3) we said "Yo! Does your hack look like our hack!?"
4) PS: That's literally the industry standard (I've personally done it plenty of times)
5) We THEN got new data that showed it wasn't a hack of any archive.
— Jonathan Nichols (@wvualphasoldier) April 25, 2018

Yet another 2018-04-25 update: Apparently there are some holes in the http vs. https canonicalization wrt robots.txt blockage, allowing some of posts to surface. Here's an example (via @YanceyMc):
https://web.archive.org/web/20060225041734/https://blog.reidreport.com/2005/10/harriet-miers-and-lesbian-hair-check.html

that page was captured in 2012; here's a version captured in 2006 (page originally authored in 2005; according to blogger):https://t.co/hrXwyC9wGH

here's a copy of that copy in @archiveis:https://t.co/kRgDN7nY4f https://t.co/103n66HKe1
— Michael L. Nelson (@phonedude_mln) April 25, 2018

Also, @wvualphasoldier deleted his tweets then protected his account, so that's the reason the above embed no longer formats correctly.

Yet, Yet Another 2018-04-25 update:

Thanks to Prof. Weigle and Mat Kelly for providing examples of some of the URLs that are slipping through the robots.txt exclusion.

Here's one: https://web.archive.org/web/20060805055643/https://blog.reidreport.com

and another: https://web.archive.org/web/20050728132003/https://blog.reidreport.com:443/

Which has the following information that I thought I saw in the original @Jamie_Maz tweets but now I can't find it, so perhaps I'm misremembering. It certainly fits the overall theme.

2018-04-24: Why we need multiple web archives: the case of blog.reidreport.com

2006-01-20

2006-06-05

2006-06-13

2006-10-03

2008-10-16

Trending Articles

Practice Sheet of Right form of verbs for HSC Students

Download: FK ft Shenky – Nakuyewa ”Prod by: Shenky”

How to win at Markstrat (Markstrat Tips and Tricks) – Vodites

Ominde Commission Report and Recommendations – Ominde Report of 1964

Bureau of Internal Revenue: Regional Offices (Directory)

GO 53 on Enhancement of Ex-gratia upto 5 Lakhs Toddy Tappers in Telangana

Cakewalk CA-2A Leveling Amplifier v2.0.1.97 WiN, v2.0.1.96 OSX Incl Keygen

Mp3 Download: Mdu - Kunjenjenjena

How the kill the job , when DTP request running for long hours.

Microsoft Intune から展開しているアプリのアップデートについて

18-year-old girl was beaten for half an hour by two Northampton men in 'an...

Car crash in Dunton Bassett leaves driver in critical condition

Macky 2, Two Others In Road Accident

Application log 00000000000000089514: Could not convert queue DLVST90CLNT

Detroit mafia: D’Anna Brothers agree to plea deal

Delivery block field greyed out using VA02

Muloraki Au

【個人撮影】スマホのプライベート映像♪「中に出さないで///」カラオケ屋での生ハメ撮りが流出ｗ【リベンジポルノ】＠PornHub

BREAKING NEWS: Diamond Platnumz Is Reported Dead After Ghastly Car Accident

FIAT 500 B0111 B0112