QA catalogue for analysing library data

en | de | pt | hu
number of records: 2,955,007     last data update: 2026-06-29 22:15     timestamp of analysis: 2026-06-30 23:33:04 (00:10:22)

Validation of latest changes

Conformity with the bibliographic metadata scheme. Accuracy is the degree to which the catalogue records conform to the syntax, the semantics, the rules and guidelines of the metadata schema (including the locally defined data elements). For those metadata fields which are not defined in the metadata schema's available documentation the tool is not able to decide if the data are correct or not. The page gives an overview and details of the captured issues. The user can check each individual records haveing a particular problem, and download their identifiers.

number of new/updated reords: 9000

records without issues
with
 
1,224 (13.60%)
7,776 (86.40%)

excluding undefined field issues

records without issues
with
 
1,224 (13.60%)
7,776 (86.40%)
instances records %
control field level issues 461 219
 
2.43
obsolete code (2 variants) [+] 21 21
 
0.23
data element message
008/34 (008visual34) " " 1 1
 
0.01
008/33 (008book33) " " 20 20
 
0.22
count: 2 | list all | grouped by tag
invalid code (5 variants) [+] 28 18
 
0.20
data element message
008/00-05 (008all00) Invalid content: '||||||'. Text '||||||' could not be parsed at index 0 2 2
 
0.02
008/18-21 (008book18) '_' in '||_|' 3 3
 
0.03
008/00-05 (008all00) Invalid content: ' '. Text ' ' could not be parsed at index 0 3 3
 
0.03
008/18-21 (008book18) 'u' in 'und|' 10 10
 
0.11
008/18-21 (008book18) 'n' in 'und|' 10 10
 
0.11
count: 5 | list all | grouped by tag
invalid value (23 variants) [+] 412 218
 
2.42
data element message
008/19 (008continuing19) " " 1 1
 
0.01
008/29 (008continuing29) " " 1 1
 
0.01
008/34 (008continuing34) " " 1 1
 
0.01
Leader/05 (leader05) " " 1 1
 
0.01
008/18-20 (008visual18) " " 1 1
 
0.01
008/29 (008book29) u 1 1
 
0.01
008/30 (008book30) n 1 1
 
0.01
008/31 (008book31) d 1 1
 
0.01
Leader/17 (leader17) I 2 2
 
0.02
008/30 (008book30) _ 3 3
 
0.03
008/31 (008book31) _ 3 3
 
0.03
008/33 (008visual33) " " 3 3
 
0.03
008/23 (008book23) 14 14
 
0.16
008/29 (008book29) " " 20 20
 
0.22
008/30 (008book30) " " 20 20
 
0.22
008/31 (008book31) " " 20 20
 
0.22
008/06 (008all06) " " 25 25
 
0.28
Leader/08 (leader08)   32 32
 
0.36
Leader/09 (leader09)   32 32
 
0.36
Leader/17 (leader17)   32 32
 
0.36
Leader/18 (leader18)   32 32
 
0.36
008/18-20 (008visual18) und 43 43
 
0.48
Leader/08 (leader08) 0 123 123
 
1.37
count: 23 | list all | grouped by tag
data field level issues 1 1
 
0.01
repetition of non-repeatable field (1 variants) [+] 1 1
 
0.01
040 there are multiple instances 1 1
 
0.01
count: 1 | list all | grouped by tag
indicator level issues 7,971 7,540
 
83.78
non-empty indicator (2 variants) [+] 7,297 7,297
 
81.08
data element message
911$ind1 0 97 97
 
1.08
911$ind1 1 7,200 7,200
 
80.00
count: 2 | list all | grouped by tag
invalid value (29 variants) [+] 674 344
 
3.82
data element message
610$ind1 " " 1 1
 
0.01
028$ind1 " " 1 1
 
0.01
028$ind2 " " 1 1
 
0.01
765$ind1 " " 1 1
 
0.01
600$ind2 " " 1 1
 
0.01
866$ind2 " " 2 2
 
0.02
880->246$ind1 " " 2 2
 
0.02
510$ind1 " " 2 2
 
0.02
600$ind1 " " 2 2
 
0.02
362$ind1 " " 2 2
 
0.02
767$ind1 " " 3 3
 
0.03
100$ind1 " " 3 3
 
0.03
776$ind1 " " 4 4
 
0.04
246$ind1 " " 5 4
 
0.04
775$ind1 " " 9 5
 
0.06
773$ind1 " " 5 5
 
0.06
700$ind1 " " 8 6
 
0.07
490$ind1 " " 8 8
 
0.09
786$ind1 " " 12 9
 
0.10
110$ind1 " " 16 16
 
0.18
655$ind2 " " 26 26
 
0.29
264$ind2 " " 32 32
 
0.36
648$ind2 " " 46 42
 
0.47
651$ind2 " " 46 44
 
0.49
774$ind1 " " 61 61
 
0.68
710$ind1 " " 80 80
 
0.89
245$ind1 " " 84 84
 
0.93
245$ind2 " " 84 84
 
0.93
650$ind2 " " 127 94
 
1.04
count: 29 | list all | grouped by tag
subfield level issues 15,044 7,504
 
83.38
undefined subfield (2 variants) [+] 14,874 7,437
 
82.63
data element message
911 # 7,422 7,422
 
82.47
911 a 7,452 7,437
 
82.63
count: 2 | list all | grouped by tag
invalid classification reference (1 variants) [+] 2 2
 
0.02
650$ind2 ind2 is '7' which means that the value should be found in subfield $2, but it is missing 2 2
 
0.02
count: 1 | list all | grouped by tag
repetition of non-repeatable subfield (3 variants) [+] 165 67
 
0.74
data element message
654$* there are multiple instances 5 5
 
0.06
650$a there are multiple instances 80 62
 
0.69
650$* there are multiple instances 80 62
 
0.69
count: 3 | list all | grouped by tag
invalid ISBN (2 variants) [+] 3 3
 
0.03
data element message
780$z ISBN does not fit the pattern \d[\d-]+[\dxX]. 1 1
 
0.01
020$a ISBN is not a valid ISBN 10 value 2 2
 
0.02
count: 2 | list all | grouped by tag
analysis parameters
files kbr-0.xml.gz
kbr-1.xml.gz
kbr-2.xml.gz
marcVersion KBR
marcFormat XML
dataSource FILE
limit -1
offset -1
id
defaultRecordType BOOKS
alephseq false
marcxml true
lineSeparated false
trimId true
recordFilter {conditions: —, empty: true } json: {"conditions":null,"empty":true}
ignorableFields {fields: [590, 591, 592, 593, 594, 595, 596, 659, 900, 912, 916, 940, 941, 942, 944, 945, 946, 948, 949, 950, 951, 952, 953, 954, 970, 971, 972, 973, 975, 977, 988, 989 ], empty: false }
stream
defaultEncoding
alephseqLineType
picaIdField 003@$0
picaSubfieldSeparator $
picaSchemaFile
picaRecordTypeField 002@$0
schemaType MARC21
groupBy
groupListFile
solrForScoresUrl http://localhost:8983/solr/kbr_scores
processRecordsWithoutId false
detailsFileName issue-details.csv
summaryFileName issue-summary.csv
format COMMA_SEPARATED
ignorableIssueTypes
pica false
unimarc false
replacementInControlFields #
marc21 true
mqaf.version 0.9.8
qa-catalogue.version 0.8.0-SNAPSHOT
numberOfprocessedRecords 2955007
duration 00:10:22
analysisTimestamp 2026-06-30 23:33:04