Comment 4 for bug 833045

Revision history for this message
George Duimovich (george-duimovich) wrote :

I guess at some point a line has to be drawn as to how much "bad data" can be anticipated / accommodated for versus library shops just fixing their data.

Here's another example from just poking around a bit more. I found over 5400 MARC 020's with data in this format (i.e. trailing colon), that might (?) present a problem with remove spaces:

<subfield code="a">0852981937 :</subfield>

This ISBN is perfectly findable in EG right now, but definitely a good / easy target for cleanup I think.... But wait, and Grrr - looking at sample records, it's clear that many/most cases the embedded ":" are there for display purposes!

020 . ‡a1551050420 : ‡c24.95

But the big data boss in the sky doesn't like that, so he commands that we change the standard, even if the standard won't change itself. That colon, IMHO, should be moved to display only in our shop IMHO, so if |c present, add the colon, etc.

Also, only a small number of our ISBN's have spaces instead of dashs FWIW.

thx