Taxonomies and Semantic Enrichment

Report
Using Taxonomies to
Create People
Directories and Author
Networks
Bert Carelli, Vice President Business
Development
www.accessinn.com
Goals and Agenda




Why is it important right now that we
develop better resources about people?
What roles can taxonomies play in this
effort?
What opportunities are being created for
knowledge discovery and collaboration?
What broad initiatives and technologies
should we be aware of?
Access Innovations Background



Founded 1978: more taxonomies built than any other
company worldwide
Thought leaders and educators in taxonomy standards
Data Harmony Software: field tested, battle hardened,
award winning software
"7-fold productivity increase"
- ProQuest CSA
"Cut search time 50%."
The Weather Channel
The Semantic Roadmap:
Knowledge Organization Systems
•Complex
•High value
•Simple
•Low Value

Semantic network

Ontology

Thesaurus

Taxonomy

Controlled vocabulary

Synonym set/ring

Name authority file

Uncontrolled list
•Linked Entities
•Contextual Specificity
•Unrelated Entities
•Ambiguity
The Web Transformed…
Mobile


Social
Web 1.0 = Getting Content Online (over
120 million websites and growing)
Web 2.0 = Getting People Online
Facebook: 700 Million Users Projected for 2011 (Open-First)
Twitter: Over 9 Billion Updates per Month
Linkedin: Over 200 Million People
Searches per Month
A highly trusted source
Social technologies will boost the utility of this long-trusted source
Business Applications of Social Media






Sharing ideas for innovation
Co-developing new products or services
Enabling customers to get help from other
customers
Generating brand awareness and visibility
Peer review
Assisting the selling process by sharing
contacts in a community
Company or Member Profiles
User pastes or
uploads CV
User Reviews
tagging for accuracy
Button to autoextract taxonomy
attributes
One Person, Many Representations
VIAF: Virtual International
Authority File
http://viaf.org/viaf/95216565/
Creating an Author Authority Database
 Tag all articles in the repository with standard subjects
 Export author names, subjects, institutions, locations, etc.
 Disambiguate authors with the same or similar names
Author Data: View by Connections…
… or by location…
…Or in the Document itself:
http://dx.doi.org/10.1371/journal.pntd.0000228.x001
Many Repositories for Names
VIAF: Virtual International
Authority File
http://viaf.org
Project VIVO



Designed around linked data standards
Resource Description Framework (RDF)
VIVO’s ontology integrates data from
human resource systems, grants
databases, faculty annual reporting
systems, and publication databases
Free open-source software download:
http://vivo.sourceforge.net
Detailed Profiles of Medical/BioMedical Researchers
Explore a research area
Focus the results
Contact a researcher with Locate
the the PI for a grant
desired expertise and research
activity
Information stored as Resource Description Framework (RDF)
 Data is structured in the form of “triples” as subjectpredicate-object.
 Concepts and their relationships use a shared ontology to
facilitate the harvesting of data from multiple sources.
Dept. of
Genetic
s
Jane
Smith
College
of
Medicine
is member of
has affiliations with
Genetics
Institute
Journal
article
author of
Book
chapter
Subject
Predicate
Book
Object
Detailed Data Relationships
Connections among scientists illustrated
David Nelson
Biomedical Informatics
has research area
Inverse relationshipshas
are
created
position
in
organization with
position for
is research area of
has position in
featured in
Mike Conlon
Ed Tech Magazine
Clinical Translational
Science Institute (CTSI)
organization with
position for
Gene Anderson
features
has author
author of
Development of an Observational Instrument to
Measure Mother-Infant Separation Post Birth
Current and accurate data revealed
author of
has author
Visualizations: co-author networks
ORCID
125 Participant Organizations
25
ORCID Profile
Exchange
Testing Possible Matching Algorithms:
•
VIAF matching technology from OCLC
•
Author Resolver from ProQuest
•
Matching capability from OKKAM
Access Innovations Author Authority
ORCID
F67572010
26
Integrating Identity into Publisher
Systems
Manuscript Submission
• Confirm background and authenticity of submitting authors
Peer review
• Confirm background and authenticity of prospective reviewers and identify
potential conflicts of interest
Connection to publications and other works
• Linking ORCID profiles with scholarly output enables disambiguation and
improves publisher decision-making.
Social networks
• Integrating profile information into publishers social network systems
enhances value and stickiness of these services (e.g. Nature Networks)
Questions?
Bert Carelli
Vice President, Business Development
Access Innovations / Data Harmony
[email protected]
510-551-3784
Corporate Number: 505-998-0800

similar documents