Editing
Using Content Models to Improve Information Ordering and Extractive Summarization
Jump to navigation
Jump to search
Warning:
You are not logged in. Your IP address will be publicly visible if you make any edits. If you
log in
or
create an account
, your edits will be attributed to your username, along with other benefits.
Anti-spam check. Do
not
fill this in!
Title: Using Content Models to Improve Information Ordering and Extractive Summarization Abstract: This research investigates the use of content models, based on Hidden Markov Models (HMMs), to improve information ordering and extractive summarization. Content models represent topics and their relationships within a specific domain, allowing for the organization of information in a meaningful way. The study shows that these models outperform existing methods, making them a promising tool for various text processing tasks. Main Research Question: Can content models, based on Hidden Markov Models, be used to improve information ordering and extractive summarization in text processing tasks? Methodology: The study uses a knowledge-lean method to learn content models directly from unannotated documents. These models represent types of information characteristic to the domain, and state transitions capture possible information presentation orderings. The models are then applied to two tasks: information ordering and extractive summarization. Results: The results show that content models outperform existing methods for information ordering by a wide margin. For extractive summarization, a new learning algorithm for sentence selection is developed, resulting in summaries that yield 88% match with human-written output, significantly better than the standard "leading n sentences" baseline. Implications: The success of content models in these two tasks demonstrates their flexibility and effectiveness. This suggests that the formalism can prove useful in a broader range of text processing applications, making it conceptually intuitive and efficiently learnable from raw document collections. Link to Article: https://arxiv.org/abs/0405039v1 Authors: arXiv ID: 0405039v1 [[Category:Computer Science]] [[Category:Models]] [[Category:Information]] [[Category:Content]] [[Category:Ordering]] [[Category:Extractive]]
Summary:
Please note that all contributions to Simple Sci Wiki may be edited, altered, or removed by other contributors. If you do not want your writing to be edited mercilessly, then do not submit it here.
You are also promising us that you wrote this yourself, or copied it from a public domain or similar free resource (see
Simple Sci Wiki:Copyrights
for details).
Do not submit copyrighted work without permission!
Cancel
Editing help
(opens in new window)
Navigation menu
Personal tools
Not logged in
Talk
Contributions
Create account
Log in
Namespaces
Page
Discussion
English
Views
Read
Edit
Edit source
View history
More
Search
Navigation
Main page
Recent changes
Random page
Help about MediaWiki
Tools
What links here
Related changes
Special pages
Page information