[Pasig-discuss] Experiences with S3-like object store service providers?
Neil Jefferies
neil.jefferies at bodleian.ox.ac.uk
Fri Nov 17 08:46:11 EST 2017
That’s nasty – I see your viewpoint, though. At the higher level of abstraction that you deal with “storage” that makes sense – since there seem to be unchecked and unauditable processes happening under the hood that may be the only good tool you have.
From: Pasig-discuss [mailto:pasig-discuss-bounces at asist.org] On Behalf Of Kyle Rimkus
Sent: 16 November 2017 01:55
To: Neil Jefferies <neil at jefferies.org>
Cc: pasig-discuss at mail.asis.org
Subject: Re: [Pasig-discuss] Experiences with S3-like object store service providers?
I see your point. We had an instance of a number of files failing their regular fixity check, after having passed several times over the years. When we investigated, the files were the same size on disk as they always were, but were just strings of zeroes, and unreadable. Whatever caused this could have been due to a transmission error behind the scenes during some sort of server maintenance, but from where we stood as a repository that had outsourced our storage we had no way of knowing one way or the other, and the fixity checks saved us.
On Wed, Nov 15, 2017 at 6:15 PM Neil Jefferies <neil at jefferies.org<mailto:neil at jefferies.org>> wrote:
Kyle,
Can you expand... because that doesn't actually sound like a "fixity"
issue (which is kind of my point). Either the wrong thing has been
copied or the wrong thing has been stored i.e. an off-storage process
failure. For those cases checks should really be done at the time of
whatever operation caused the issue. Waiting for a check to pick it up
downstream at some later time is somewhat risky.
My point is, these are not "fixity" checks, they are transmission
checks.
Neil
On 2017-11-15 22:59, Kyle Rimkus wrote:
> On the issue of fixity checking, I agree there is a great deal of
> misplaced paranoia around "bit-flipping" in contemporary storage
> technologies, but at the same time I've found regular fixity checks to
> be very useful in protecting against various types of human error that
> can be made in managing the storage itself or in scripted processes
> that interact with stored data.
>
> As an example, at my university we farm our storage out to a campus
> unit which stores two copies locally while we push a third into Amazon
> Glacier. Our preservation repository software's regular fixity checks
> of on-campus data have helped us keep our storage providers and
> ourselves honest. We have on more than one occasion discovered fixity
> errors that pointed to questionable server management. Regular fixity
> checking was what flagged us to these errors, and the storage of a
> third file copy off-site was what saved us.
>
> We are also looking into pushing more of our storage services into the
> (most likely AWS) cloud, where greater guarantees are made against
> this type of problem. Maybe in time we'll come to see regular fixity
> checks as less critically important than we do now. Julian's comment
> that "certifying a particular storage environment to either store bits
> correctly or alert on failure should be acceptable" is interesting.
> I'm sure we'd all like to get away from having to run constant fixity
> checks in our repositories, and would like to see digital preservation
> management architecture evolve in this direction. For now though I'd
> wager that most of us are constrained by the fact that fixity checking
> remains essential to making sure our storage is doing what it claims
> to do.
>
> Kyle
>
> --
>
> Kyle R. Rimkus
>
> Assistant Professor
>
> Preservation Librarian
>
> University of Illinois at Urbana-Champaign
> ----
> To subscribe, unsubscribe, or modify your subscription, please visit
> http://mail.asis.org/mailman/listinfo/pasig-discuss
> _______
> PASIG Webinars and conference material is at
> http://www.preservationandarchivingsig.org/index.html
> _______________________________________________
> Pasig-discuss mailing list
> Pasig-discuss at mail.asis.org<mailto:Pasig-discuss at mail.asis.org>
> http://mail.asis.org/mailman/listinfo/pasig-discuss
--
Kyle R. Rimkus
Assistant Professor
Preservation Librarian
University of Illinois at Urbana-Champaign
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.asis.org/pipermail/pasig-discuss/attachments/20171117/f8f25def/attachment.html>
More information about the Pasig-discuss
mailing list