Editing
SteveCassidy
Jump to navigation
Jump to search
Warning:
You are not logged in. Your IP address will be publicly visible if you make any edits. If you
log in
or
create an account
, your edits will be attributed to your username, along with other benefits.
Anti-spam check. Do
not
fill this in!
Title: SteveCassidy Abstract: This research paper presents and harmonizes two independent efforts to model annotated speech databases, one at Macquarie University and one at the University of Pennsylvania. It discusses various query languages and applications, focusing on the annotation graph model. The research aims to develop platform-independent open-source tools for creating, browsing, searching, querying, and transforming linguistic databases, ultimately disseminating large linguistic databases over the internet. Research Question: How can we develop a comprehensive and flexible model for annotated speech databases that can handle their multidimensional, heterogeneous, and dynamic nature, while also addressing the temporal complexity of the data? Methodology: The paper proposes two database models: the Emu model from Macquarie University, which organizes data primarily in terms of its hierarchical structure, and the annotation graph model from the University of Pennsylvania, which foregrounds the temporal structure. The authors demonstrate the expressive equivalence of the two models. Results: The research shows that both models can effectively represent annotated speech databases. The annotation graph model, however, is particularly well-suited for handling the temporal complexity of the data. It represents the data as a directed graph, with nodes representing annotations and edges representing temporal relationships between them. Implications: The research has significant implications for the field of linguistics and natural language processing. It provides a robust and flexible framework for managing and querying annotated speech databases, which can be applied to a wide range of applications, such as automatic tagging and parsing, machine translation, and information retrieval. Furthermore, the open-source tools developed as part of this research can facilitate the sharing and dissemination of large linguistic databases across the internet. In conclusion, the paper presents a comprehensive and flexible model for annotated speech databases that can effectively handle their multidimensional, heterogeneous, and dynamic nature, as well as their temporal complexity. The research has important implications for the field of linguistics and natural language processing, and the developed tools can facilitate the sharing and dissemination of large linguistic databases. Link to Article: https://arxiv.org/abs/0204026v1 Authors: arXiv ID: 0204026v1 [[Category:Computer Science]] [[Category:Databases]] [[Category:Research]] [[Category:Model]] [[Category:Can]] [[Category:Annotated]]
Summary:
Please note that all contributions to Simple Sci Wiki may be edited, altered, or removed by other contributors. If you do not want your writing to be edited mercilessly, then do not submit it here.
You are also promising us that you wrote this yourself, or copied it from a public domain or similar free resource (see
Simple Sci Wiki:Copyrights
for details).
Do not submit copyrighted work without permission!
Cancel
Editing help
(opens in new window)
Navigation menu
Personal tools
Not logged in
Talk
Contributions
Create account
Log in
Namespaces
Page
Discussion
English
Views
Read
Edit
Edit source
View history
More
Search
Navigation
Main page
Recent changes
Random page
Help about MediaWiki
Tools
What links here
Related changes
Special pages
Page information