How to make your content users more productive using Access Innovations, Inc.'s Navtree and Machine Aided Indexer (M.A.I.™), parts of the Data Harmony® software suite.
2. 2
Taxonomies aid site
organization
Taxonomy provides:
Framework for content
organization
Hierarchical outline of your
content by subject categories
Basis for effective browsing
3. 3
Integrated taxonomy enhances
findability
Browsable categories of a directory
Smart search for term equivalents
Taxonomy terms (original or modified)
as labels
Navigation aids incorporate taxonomy
terms and relationships
4. 4
Example Search: body growth
Complete database (60,000 + titles)
Free text search
8 hits — some irrelevant
Free text search on titles
6 hits — limited recall
Search by taxonomy descriptor (AKA
subject term or category)
470 hits
100% relevant
100% recall
5. 5
Increasing User
Productivity
Items in an information collection
can be retrieved with better
precision (relevance) and better
recall by using a controlled
vocabulary to assign subject terms
(key words) to them
How do you connect your
users to the controlled
vocabulary?
6. 6
Connecting Users
1. Use the rulebase you’ve
developed for machine
aided indexing (MAIQuery)
2. Use the controlled
vocabulary itself
(TM Navtree)
7. 7
MAI’s talents
MAI (Machine Aided Indexer)
helps authors and editors
assign effective subject terms
automates the assignment of
subject terms to items in legacy
collections
8. 8
M.A.I. suggests the correct terms
from the taxonomy as descriptors
M.A.I. rulebase recognizes term
equivalents
germs Microorganisms
vaccin* Pharmaceutical drugs
Recognizing term equivalents
enables enhanced search
Taxonomy terms on documents
help sort and organize the content
9. 9
MAI’s “hidden talents”
MAI can also:
Provide for the appropriate
preferred term when given a
word or phrase
Return preferred terms for uses
of the word in different contexts
10. 10
More “hidden talents”
MAIQuery can:
Show related terms from the
thesaurus to broaden a search
Show the rules and preferred
term’s scope notes to clarify
how the preferred term relates
to others in the thesaurus
11. 11
Presenting: MAIQuery™
Web page presents a search box
that will use the MAI rulebase
Can be in addition to full text
search and advanced search
User enters a word or phrase in
the search box
MAI searches the rulebase for
any occurrences of the word(s)
13. 13
the MAIQuery demo
Uses web pages and php coding:
Passes the search words to
“dosearch.php”
dosearch.php passes the term to
MAI’s concept extractor
MAI returns a list of suggested
terms from the controlled
vocabulary
14. 14
Suggested terms
The term Music is suggested
by the rule for music*(1)
Click on the first (the preferred term) to see the term record; click on the second to see the MAI rule
The term Instrumental Music is suggested
by the rule for music*(1)
Click on the first (the preferred term) to see the term record; click on the second to see the MAI rule
15. 15
Options
Thesaurus Master can be queried
to show the term record
Broader term
Narrower terms
Use For terms (“synonyms”)
Related terms
Scope notes
19. 19
Options, continued
Your database/index of items is
then queried to bring back the
records in your collection that are
indexed with the preferred term
For our demo, we wrote an xquery
request into the gettitles.php file
Our 1100-title demo records are
maintained by a MarkLogic server
23. 23
How’s it working?
What words and phrases do your
users search for?
a search log can record “misses”
a user focus group can suggest
additions
subject matter experts can help in
their area of expertise
24. 24
Fine tuning
Modify your taxonomy to respond
to more words
add common misspellings to
rules
add alternate words as Use For
terms (synonyms) in the
thesaurus
(or as additions to the rules)
consider terms for addition to
the thesaurus (candidates)
25. 25
The advantages
MAIQuery connects your user
with the controlled vocabulary
Your user can review term
records and rulebase rules to
learn more about your taxonomy
Your user becomes more
productive
26. 26
Another way to connect users
Category search used more than
half the time for research
Also known as directory search,
your user “drills down” from
general to specific
27. 27
Value of Category search
Searchers find info 50% faster
using browsable categories than
using list returned from free text
search
Results even stronger when results
not in top 20 returns
Searchers prefer browsable
category search
Chen, H., and Dumais, S.
33. 33
Thesauri OnLine
Australian Governments' Interactive
Functions Thesaurus – AGIFT
http://www.naa.gov.au/recordkeeping/thesauru
Transportation Research Thesaurus – TRT
http://ntl.bts.gov/trt/trt_topterms.jsp
NBII (National Biological Information
Infrastructure)
http
://thesaurus.nbii.gov/SearchNBIIThesaurus/ab
34. 34
Presenting: TM Navtree
Your thesaurus presented as a
navigation aid
User “drill down” with all the
neighboring terms visible
Each term indicates the number
of documents indexed with it
Terms are hyperlinks to a list of
items
38. 38
Choose a title
Click on a title, get its details (or
bring up the item)
39. 39
How it’s done
We used PHP Levels, an open
source application from
SourceForge to create the tree
An exported XML version of the
thesaurus is parsed to produce
the required text file to populate
the tree
The content manager is queried
for the document totals
40. 40
How it’s done, continued
When a term is selected, it is
passed to a gettitles.php
A bit of php code connects to the
content manager and returns a
string of data about each title
The web page displays the data
in the format desired
41. 41
The advantages
TM Navtree Top Terms describe
the organization of your
collection(s)
Narrower terms help your user
hone in on the most appropriate
term
Adjacent terms impart
connotation
42. 42
The advantages
ALL the records indexed with the
chosen term are returned
Your user finds what’s needed
more quickly and is more
productive
43. 43
Questions?
Comments?
Try out the demo at
www.mediasleuth.com
See more details:
Data Harmony Programmer
Interface for Web Applications
Thank you.
Mary Garcia
44. 44
MAI Query and NavTree from
Data Harmony
Making Users
More Productive
Notes de l'éditeur
There are other forms of organization – alpha, chronological, geographical, audience, etc.
Taxonomy organizes by topic, by subject, by aboutness.
We already know that it helps your authors and editors assign effective subject terms
And that it automates indexing items in legacy collections
Recognizing term equivalents – important point, we’ll see more on this later.
We already know that it helps your authors and editors assign effective subject terms
And that it automates indexing items in legacy collections
We already know that it helps your authors and editors assign effective subject terms
And that it automates indexing items in legacy collections
Your user interface can offer a rule-base assisted search or a full text search
Any language – jsp, asp, Perl – can be used
When connected, all the advantages of using a controlled vocabulary for indexing are made available to the user
Level 2
Level 3 - success for this category
Your user interface can offer a rule-base assisted search or a full text search
Our demo page includes an MAIQuery search box also
We see 4 levels here – Business, Business Enterprises, Corporations, Corporate structure
Each indicates how many titles are indexed with it and how many are indexed with either it or its child, narrower terms
See options at bottom
When connected, all the advantages of using a controlled vocabulary for indexing are made available to the user
When connected, all the advantages of using a controlled vocabulary for indexing are made available to the user