QA catalogue for analysing library data

en | de | pt | hu
number of records: 2,958,742     last data update: 2026-07-01 22:15     timestamp of analysis: 2026-07-01 21:46:33 (00:11:16)

Validation of latest changes

Conformity with the bibliographic metadata scheme. Accuracy is the degree to which the catalogue records conform to the syntax, the semantics, the rules and guidelines of the metadata schema (including the locally defined data elements). For those metadata fields which are not defined in the metadata schema's available documentation the tool is not able to decide if the data are correct or not. The page gives an overview and details of the captured issues. The user can check each individual records haveing a particular problem, and download their identifiers.

number of new/updated reords: 19479

records without issues
with
 
2,983 (15.31%)
16,496 (84.69%)

excluding undefined field issues

records without issues
with
 
2,983 (15.31%)
16,496 (84.69%)
instances records %
control field level issues 13,760 3,122
 
16.03
obsolete code (2 variants) [+] 123 123
 
0.63
data element message
008/34 (008visual34) " " 7 7
 
0.04
008/33 (008book33) " " 116 116
 
0.60
count: 2 | list all | grouped by tag
invalid code (9 variants) [+] 3,911 1,968
 
10.10
data element message
008/18-21 (008book18) '_' in '|||_' 3 3
 
0.02
008/24-27 (008book24) '_' in '____' 12 3
 
0.02
008/30-31 (008music30) '_' in '|_' 3 3
 
0.02
008/18-21 (008book18) '_' in '||_|' 6 6
 
0.03
008/00-05 (008all00) Invalid content: ' '. Text ' ' could not be parsed at index 0 10 10
 
0.05
008/00-05 (008all00) Invalid content: '||||||'. Text '||||||' could not be parsed at index 0 15 15
 
0.08
008/18-21 (008book18) 'u' in 'und|' 40 40
 
0.21
008/18-21 (008book18) 'n' in 'und|' 40 40
 
0.21
008/33-34 (008map33) '|' in '||' 3,782 1,891
 
9.71
count: 9 | list all | grouped by tag
invalid value (29 variants) [+] 9,726 3,091
 
15.87
data element message
008/23 (008continuing23) 1 1
 
0.01
008/20 (008music20) " " 1 1
 
0.01
008/19 (008continuing19) " " 2 2
 
0.01
Leader/17 (leader17) I 2 2
 
0.01
008/23 (008book23) _ 3 3
 
0.02
008/29 (008continuing29) " " 3 3
 
0.02
008/34 (008continuing34) " " 3 3
 
0.02
Leader/05 (leader05) " " 3 3
 
0.02
008/39 (008all39) \ 3 3
 
0.02
008/18-19 (008music18) " " 3 3
 
0.02
008/33 (008music33) _ 3 3
 
0.02
008/18-20 (008visual18) " " 7 7
 
0.04
008/30 (008book30) _ 9 9
 
0.05
008/31 (008book31) _ 9 9
 
0.05
008/33 (008visual33) " " 11 11
 
0.06
008/29 (008book29) u 11 11
 
0.06
008/30 (008book30) n 11 11
 
0.06
008/31 (008book31) d 11 11
 
0.06
008/23 (008book23) 73 73
 
0.37
008/29 (008book29) " " 117 117
 
0.60
008/30 (008book30) " " 117 117
 
0.60
008/31 (008book31) " " 117 117
 
0.60
008/06 (008all06) " " 147 147
 
0.75
008/18-20 (008visual18) und 207 207
 
1.06
Leader/08 (leader08) 0 788 788
 
4.05
Leader/08 (leader08)   2,016 2,016
 
10.35
Leader/09 (leader09)   2,016 2,016
 
10.35
Leader/17 (leader17)   2,016 2,016
 
10.35
Leader/18 (leader18)   2,016 2,016
 
10.35
count: 29 | list all | grouped by tag
data field level issues 9 5
 
0.03
repetition of non-repeatable field (4 variants) [+] 6 3
 
0.02
data element message
911 there are multiple instances 1 1
 
0.01
245 there are multiple instances 1 1
 
0.01
044 there are multiple instances 1 1
 
0.01
040 there are multiple instances 3 3
 
0.02
count: 4 | list all | grouped by tag
undefined field (3 variants) [+] 3 2
 
0.01
data element message
925 925 1 1
 
0.01
867 867 1 1
 
0.01
868 868 1 1
 
0.01
count: 3 | list all | grouped by tag
indicator level issues 15,722 13,447
 
69.03
non-empty indicator (2 variants) [+] 12,220 12,220
 
62.73
data element message
911$ind1 0 317 317
 
1.63
911$ind1 1 11,903 11,903
 
61.11
count: 2 | list all | grouped by tag
invalid value (43 variants) [+] 3,502 1,466
 
7.53
data element message
772$ind1 " " 1 1
 
0.01
787$ind1 " " 1 1
 
0.01
247$ind1 " " 1 1
 
0.01
247$ind2 " " 1 1
 
0.01
610$ind2 " " 1 1
 
0.01
210$ind1 " " 1 1
 
0.01
785$ind1 " " 1 1
 
0.01
785$ind2 " " 1 1
 
0.01
780$ind1 " " 1 1
 
0.01
780$ind2 " " 1 1
 
0.01
880->246$ind1 " " 2 2
 
0.01
028$ind1 " " 2 2
 
0.01
028$ind2 " " 2 2
 
0.01
770$ind1 " " 3 2
 
0.01
880->245$ind1 " " 2 2
 
0.01
880->245$ind2 " " 2 2
 
0.01
777$ind1 " " 3 3
 
0.02
610$ind1 " " 4 4
 
0.02
100$ind1 " " 4 4
 
0.02
767$ind1 " " 14 8
 
0.04
362$ind1 " " 9 9
 
0.05
510$ind1 " " 10 10
 
0.05
866$ind2 " " 20 15
 
0.08
600$ind2 " " 19 16
 
0.08
765$ind1 " " 16 16
 
0.08
600$ind1 " " 22 19
 
0.10
700$ind1 " " 35 23
 
0.12
776$ind1 " " 24 24
 
0.12
246$ind1 " " 32 26
 
0.13
775$ind1 " " 95 31
 
0.16
773$ind1 " " 45 45
 
0.23
110$ind1 " " 45 45
 
0.23
490$ind1 " " 51 50
 
0.26
786$ind1 " " 107 82
 
0.42
710$ind1 " " 123 122
 
0.63
655$ind2 " " 170 167
 
0.86
648$ind2 " " 249 230
 
1.18
651$ind2 " " 241 232
 
1.19
264$ind2 " " 285 285
 
1.46
774$ind1 " " 288 288
 
1.48
650$ind2 " " 545 412
 
2.12
245$ind2 " " 511 511
 
2.62
245$ind1 " " 512 512
 
2.63
count: 43 | list all | grouped by tag
subfield level issues 26,639 13,197
 
67.75
undefined subfield (3 variants) [+] 25,574 12,784
 
65.63
data element message
856 k 1 1
 
0.01
911 # 12,686 12,683
 
65.11
911 a 12,887 12,783
 
65.62
count: 3 | list all | grouped by tag
invalid classification reference (3 variants) [+] 27 15
 
0.08
data element message
600$ind2 ind2 is '7' which means that the value should be found in subfield $2, but it is missing 1 1
 
0.01
655$ind2 ind2 is '7' which means that the value should be found in subfield $2, but it is missing 9 7
 
0.04
650$ind2 ind2 is '7' which means that the value should be found in subfield $2, but it is missing 17 13
 
0.07
count: 3 | list all | grouped by tag
repetition of non-repeatable subfield (9 variants) [+] 1,024 413
 
2.12
data element message
500$a there are multiple instances 1 1
 
0.01
035$a there are multiple instances 1 1
 
0.01
040$a there are multiple instances 1 1
 
0.01
245$a there are multiple instances 1 1
 
0.01
245$b there are multiple instances 2 2
 
0.01
245$c there are multiple instances 2 2
 
0.01
654$* there are multiple instances 40 24
 
0.12
650$a there are multiple instances 488 386
 
1.98
650$* there are multiple instances 488 386
 
1.98
count: 9 | list all | grouped by tag
invalid ISBN (5 variants) [+] 13 13
 
0.07
data element message
780$z ISBN does not fit the pattern \d[\d-]+[\dxX]. 1 1
 
0.01
775$z ISBN length should be either 10 or 13. 1 1
 
0.01
020$a ISBN does not fit the pattern \d[\d-]+[\dxX]. 2 2
 
0.01
020$a ISBN length should be either 10 or 13. 4 4
 
0.02
020$a ISBN is not a valid ISBN 10 value 5 5
 
0.03
count: 5 | list all | grouped by tag
invalid ISSN (1 variants) [+] 1 1
 
0.01
022$a ISSN does not fit the pattern \d{4}-\d{3}[\dX]. 1 1
 
0.01
count: 1 | list all | grouped by tag
analysis parameters
files kbr-0.xml.gz
kbr-1.xml.gz
kbr-2.xml.gz
marcVersion KBR
marcFormat XML
dataSource FILE
limit -1
offset -1
id
defaultRecordType BOOKS
alephseq false
marcxml true
lineSeparated false
trimId true
recordFilter {conditions: —, empty: true } json: {"conditions":null,"empty":true}
ignorableFields {fields: [590, 591, 592, 593, 594, 595, 596, 659, 900, 912, 916, 940, 941, 942, 944, 945, 946, 948, 949, 950, 951, 952, 953, 954, 970, 971, 972, 973, 975, 977, 988, 989 ], empty: false }
stream
defaultEncoding
alephseqLineType
picaIdField 003@$0
picaSubfieldSeparator $
picaSchemaFile
picaRecordTypeField 002@$0
schemaType MARC21
groupBy
groupListFile
solrForScoresUrl http://localhost:8983/solr/kbr_scores
processRecordsWithoutId false
detailsFileName issue-details.csv
summaryFileName issue-summary.csv
format COMMA_SEPARATED
ignorableIssueTypes
pica false
unimarc false
replacementInControlFields #
marc21 true
mqaf.version 0.9.8
qa-catalogue.version 0.8.0-SNAPSHOT
duration 00:11:16
numberOfprocessedRecords 2958742
analysisTimestamp 2026-07-01 21:46:33