QA catalogue for analysing library data

en | de | pt | hu
number of records: 2,953,005     last data update: 2026-06-25 22:15     timestamp of analysis: 2026-06-25 12:22:41 (00:03:41)

Serials analysis

These scores are calculated for each continuing resources (type of record (LDR/6) is language material ('a') and bibliographic level (LDR/7) is serial component part ('b'), integrating resource ('i') or serial ('s')).

The calculation is based on a slightly modified version of the method published by Jamie Carlstone in the following paper:

Jamie Carlstone (2017) Scoring the Quality of E-Serials MARC Records Using Java, Serials Review, 43:3-4, pp. 271-277, DOI: 10.1080/00987913.2017.1350525 URL: https://www.tandfonline.com/doi/full/10.1080/00987913.2017.1350525

Histogram

  • y: number of records
  • x: number of authority names in one record
Each records having ... get a score based on a number of criteria. Each criteria results in a positive or negative score. The final score is these criteria scores.
criteria score
1. date1 (008/07-11) is unknown ('uuuu') -3
2. place of publication (008/15) is unknown (~ 'xx.+') -1
3. publication language (008/35) is unknown (xxx) -1
4. has authentication code (042$a) 7
5. encoding level (LDR/17) is Full level (‘ ‘) or Full level, material not examined (1) or Full level input by OCLC participants (I) 7
6. encoding level (LDR/17) is Added from a batch process (M), L, or Minimal level input by OCLC participants (K), or Minimal level (7) 1
7. has 006 field 1
8. has publisher (260) 1
9. has production, publication, distribution (264) 1
10. has publication frequency (310) 1
11. has content type (336) 1
12. has dates of publication (362) 1
13. has source of description (588) 1
14. has no subject headings -5
15. for each subject headings 1
16. authentication code (042$a) = “ppc” 100
17. date1 begins with '0' -100
18. Encoding level is abbreviated -100

components

The histograms of the individual components:

1. Date 1 is unknown

2. Country of publication is unknown

3. Publication language is unknown

4. Authentication code is empty

5. Encoding level is full

6. Encoding level is M, L, K or 7

7. 006 is present

8. Publisher 260 (AACR2) is present

9. Publisher 264 (RDA) is present

10. Publication frequency is present

11. Content Type (336) is present

12. Dates of Publication (362) is present

13. Source of Description Note (588) is present

14. No subject is present

15. Subject is present

16. Authentication Code is pcc

17. First date (008/07) startes with 0

18. Title is inactive - no date2

19. Encoding level is abbreviated

analysis parameters
files kbr-0.xml.gz
kbr-1.xml.gz
kbr-2.xml.gz
marcVersion KBR
marcFormat XML
dataSource FILE
limit -1
offset -1
id
defaultRecordType BOOKS
alephseq false
marcxml true
lineSeparated false
trimId true
recordFilter {conditions: —, empty: true } json: {"conditions":null,"empty":true}
ignorableFields {fields: [590, 591, 592, 593, 594, 595, 596, 659, 900, 912, 916, 940, 941, 942, 944, 945, 946, 948, 949, 950, 951, 952, 953, 954, 970, 971, 972, 973, 975, 977, 988, 989 ], empty: false }
stream
defaultEncoding
alephseqLineType
picaIdField 003@$0
picaSubfieldSeparator $
picaSchemaFile
picaRecordTypeField 002@$0
schemaType MARC21
groupBy
groupListFile
solrForScoresUrl http://localhost:8983/solr/kbr_scores
processRecordsWithoutId false
fileName serial-score.csv
replacementInControlFields #
marc21 true
unimarc false
pica false
mqaf.version 0.9.8
qa-catalogue.version 0.8.0-SNAPSHOT
numberOfprocessedRecords 2953005
duration 00:03:41
analysisTimestamp 2026-06-25 12:22:41