Skip to main content

View Post [edit]

Poster: PiRSquared Date: Feb 26, 2015 10:31am
Forum: forums Subject: Re: Why are simple images missing from very recent archived pages?

I think Heritrix sometimes misses resources like images and CSS, although in my experience recent crawls seem to be more complete. I checked again and those images are actually being served from web.archive.org, so maybe it wasn't archived before but is now. Perhaps when it was opened in the browser, it tried to save the images again. Anyway, you're right about something: if you want something saved, it's probably a good idea to make a copy for yourself "just in case".

Reply [edit]

Poster: thenewdesignisshitty Date: Mar 2, 2015 6:32am
Forum: forums Subject: Re: Why are simple images missing from very recent archived pages?

> Perhaps when it was opened in the browser, it tried to save the images again.

but but - I mean - so - it - then that's not an archive. It's just *The Internet*, and there's already an Internet.

They need to let people know this is a Kinda Maybe Cross-Your-Fingers-and-Pray Archive, not a normal archive as we're led to believe.