EAD Tag Usage - Society of American Archivists

Report
EAD Tag Usage
KATHERINE M. WISSER & JACKIE DEAN
AUGUST 2011
EAD ROUNDTABLE AND EAD FORUM
SOCIETY OF AMERICAN ARCHIVISTS
Some initial numbers
 1,136 finding aids
 108 repositories
File Sizes
30KB
5,292 KB
12,0333KB
Mean: 273.22 KB, Median: 50 KB, Mode: 11 KB
<eadheader>
Element
Number in
Sample
Number in
unique finding
aid
% in Sample
eadid
1,136
1,136
100.0%
filedesc
1,136
1,136
100.0%
profiledesc
1,114
1,114
98.1%
372
372
32.7%
revisiondesc
<filedesc>
Element
editionstmt
notestmt
publicationstmt
seriesstmt
Titlestmt
Number in
Sample
Number in
unique finding
aid
% in Sample
45
45
4.0%
103
103
9.1%
1,081
1,081
95.2%
1
1
0.1%
1,136
1,136
100.0%
<profiledesc>
Element
Number in
Sample
Creation
Descrules
Langusage
Number in
unique finding
aid
% in Sample
1,076
1,076
94.7%
486
486
42.8%
1,047
1,047
92.2%
<revisiondesc>
Element
Change
List
Number in
Sample
Number in
unique finding
aid
% in Sample
552
345
30.4%
7,575
27
2.4%
<frontmatter>
(n=279)
Element
titlepage
div
empty
Number in
Sample
% in Sample
259
92.8%
6
2.2%
14
5.0%
Values for @level within <archdesc>
@level value
collection
Number in
Sample
% in Sample
1,033
90.9%
fonds
55
4.8%
class
3
0.3%
16
1.4%
series
7
0.6%
subfonds
3
0.3%
subgrp
11
1.0%
subseries
0
0.0%
File
4
0.4%
item
1
0.1%
otherlevel
1
0.1%
recordgrp
Total
1,136
Elements within the <archdesc>/<did>
Element
abstract
Number in
Sample
Unique finding
aids
% in Sample
1,085
984
86.6%
10
4
0.4%
langmaterial
1,042
1,021
89.9%
materialspec
18
18
1.6%
origination
1,216
1,011
89.0%
physdesc
1,176
1,104
97.2%
557
316
27.8%
repository
1,141
1,132
99.6%
unitdate
1,651
1,102
97.0%
unitid
1,151
1,024
90.1%
unittitle
1,582
1,136
100.0%
container
physloc
Other elements within the <archdesc>
Element
accessrestrict
Number in
Sample
Unique finding
aids
% in Sample
991
979
86.2%
81
81
7.1%
acqinfo
796
772
68.0%
altformavail
152
144
12.7%
appraisal
57
54
4.8%
custodhist
163
160
14.1%
originalsloc
39
39
3.4%
otherfindaid
146
135
11.9%
phystech
48
48
4.2%
prefercite
970
970
85.4%
processinfo
676
645
56.8%
userestrict
808
776
68.3%
accruals
Other elements within the <archdesc>
Element
Number in
Sample
Unique finding
aids
% in Sample
arrangement
761
744
65.5%
bibliography
134
115
10.1%
1,118
992
87.3%
3,543
966
85.0%
7
7
0.6%
48
14
1.2%
odd
214
110
9.7%
relatedmaterial
494
458
40.3%
scopecontent
1,111
1,061
93.4%
178
168
14.8%
bioghist
controlaccess
fileplan
index
separatedmaterial
<dsc>
(n=1,136)
Element
One <dsc>
Number in
Sample
% in Sample
1,026
90.3%
Multiple <dsc>s
27
2.4%
No <dsc>s
83
7.3%
@type for <dsc>
@type values
Total <dsc>s
Number in
Sample
% in Sample
1,105
97.2%
(n=1,136)
no type attribute
90
8.1%
(n=1,105)
analyticover
56
5.1%
(n=1,105)
combined
735
66.5%
(n=1,105)
in-depth
185
16.7%
(n=1,105)
othertype
39
3.5%
(n=1,105)
<c>-<c12> (n=1,053)
Element
c
Number in
Sample
Unique finding
aids
% in Sample
113,133
117
11.1%
c01
31,792
927
88.0%
c02
189,148
763
72.5%
c03
239,029
440
41.8%
c04
104,161
217
20.6%
c05
31,306
113
10.7%
c06
10,820
48
4.6%
c07
3,127
21
2.0%
c08
1,546
7
0.7%
c09
485
3
0.3%
c10
10
1
0.1%
c011 = 0
c012 = 0
300,000
250,000
200,000
150,000
100,000
50,000
0
<c01>
<c02>
<c03>
<c04>
<c05>
<c06>
<c07>
<c08>
<c09>
<c10>
<c11>
<c12>
Values for @level within <dsc> (n=1,053)
@level value
collection
Number in
Sample
Unique finding
aids
% in Sample
509
22
2.1%
12
7
0.7%
1,535
13
1.2%
31
7
0.7%
8,390
818
77.7%
subfonds
119
18
1.7%
subgrp
339
33
3.1%
14,962
372
35.3%
file
357,262
599
56.9%
item
130,178
255
24.2%
25,877
96
9.1%
fonds
class
recordgrp
series
subseries
otherlevel
Elements within the <c>-<c12>/<did> (n=1,053)
Element
abstract
Number in
Sample
Unique finding
aids
% in Sample
7,128
26
2.5%
704,884
869
82.5%
langmaterial
10,078
64
6.1%
materialspec
8,395
14
1.3%
20,549
85
8.1%
124,763
573
54.4%
11,354
61
5.8%
2,651
3
0.3%
unitdate
470,673
954
90.6%
unitid
233,952
486
46.2%
unittitle
697,246
1,041
98.9%
container
origination
physdesc
physloc
repository
Other elements within the <c>-<c12> (n=1,053)
*Note: in the presentation at SAA the number for prefercite was erroneously reported as the archdesc-level value.
Element
accessrestrict
Number in
Sample
Unique finding
aids
% in Sample
6,727
113
10.7%
0
0
0.0%
acqinfo
3,613
47
4.5%
altformavail
5,518
28
2.7%
43
7
0.7%
5,504
23
2.2%
originalsloc
150
11
1.0%
otherfindaid
463
24
2.3%
phystech
570
16
1.5%
prefercite *
2
1
0.1%
processinfo
740
40
3.8%
userestrict
848
34
3.2%
accruals
appraisal
custodhist
Other elements within the <c>-<c12> (n=1,053)
Element
Number in
Sample
Unique finding
aids
% in Sample
arrangement
2,128
200
19.0%
bibliography
1,435
16
1.5%
bioghist
1,099
48
4.6%
58,366
54
5.1%
0
0
0.6%
2,943
7
0.7%
16,525
76
7.2%
1,701
46
4.4%
110,648
645
61.3%
0
0
0.0%
controlaccess
fileplan
index
odd
relatedmaterial
scopecontent
separatedmaterial
Content tags in <dsc> (n=1,053)
Element
Number in
Sample
Number in
unique finding
aid
% in Sample
corpname
11,384
88
8.4%
famname
251
18
1.7%
0
0
0.0%
genreform
28,082
66
6.3%
name
23,555
15
1.4%
204
4
0.4%
persname
66,197
136
12.9%
subject
28,767
49
4.7%
function
occupation
Digital Archival Objects
Element
dao
Number in
Sample
Number in
unique finding
aid
% in Sample
24,997
87
7.7%
daodesc
2,217
136
12.0%
daogrp
5,503
106
9.3%
daoloc
12,193
123
10.8%
Date attributes
Attributes
Number in
Sample
calendar
193,658
certainty
5,101
datechar
4,229
era
193,669
type
111,125
type=“inclusive”
110,744
type=“bulk”
381
@relatedencoding
@relatedencoding Standard
value
Number in
sample
relatedencoding
Number of
@relatedencoding
% of all
@relatedencoding
1,662
MARC
1,079
66.5%
Dublin Core
521
32.1%
ISAD(G)v2
19
1.2%
MidosaXML
3
0.2%
Number in
% with
Sample relatedencoding
(n=885)
@ in sample
(n=1,136)
one
222
25.1%
19.5%
two
569
64.3%
50.1%
94
10.6%
8.3%
three

similar documents