The manner in which information is shared has changed rapidly
with the advent of the Web. New forms for sharing lexical data have
emerged, too, owing to the transition from ink to Internet. A notable
one is Linguistic Linked Data (LLD), which offers several benefits
over other formats by taking advantage of decentralized Internet
technology and open Web standards. The benefits in using this form
– and in applying related Web technologies – are demonstrated in the
new Web application Evoke. Evoke allows exploring and interacting
with linguistic resources, employing a novel method that enables
users to view a thesaurus, extend it – without the risk of infringing on
licenses or requiring additional hosting costs with publishers – and
perform new and exciting analyses over a combination of datasets.
Moreover, the application showcases how a number of mechanisms,
built on top of LLD, can be employed to reduce barriers for users to
start working with valuable linguistic resources. The development of
Evoke has been supported by experts in lexicography, linguistics and
philology (amongst other fields), to ensure the software is intuitive
and useful for both research and educational purposes. The following
sections will discuss (1) the Web application Evoke; (2) A Thesaurus
of Old English, the first LLD resource made available in Evoke; and
(3) current uses in research and education of these two resources.
(1) Evoke
The Web application Evoke allows users to interact with thesauri.
These lexicographic works organize words according to meaning –
by means of a semantic hierarchy – rather than alphabetically, thus
enabling the user to move from meaning to words that express that
meaning. Users of Evoke can not only browse and view content
captured within these linguistic resources (cf. Figures 1 and 2), but
also expand on them and, owing to the organization principle of
such works, perform analyses through the semantic lens of their
overarching hierarchies. The resulting onomasiological profiles that
are generated by Evoke are based on features of interest that the user
selects: specific labels, languages, and/or parts of speech. A generated
profile contains statistics and charts on the item count of a selection,
its degree of ambiguity (indicating polysemy), its degree of synonymy,
Sander Stolk is a PhD
researcher in Digital
Humanities at Leiden
University. He holds an
MSc in Computer Science
from VU Amsterdam
and an MA in English
Language and Culture
from Leiden University.
His research investigates
how the dissemination
and reuse of thesauri can
be improved by utilizing
Linked Data technology,
having the analysis of
information needs of
scholars and development
of the Web application
Evoke at its core, and he
is a member of the W3C
OntoLex community
that works on standards
regarding Linguistic
Linked Data. He is also
Head of Innovation at
Semmtech, a company
specializing in using
Linked Data technology.