Content Mining in the News/Literature

“Legal uncertainty” is pointed to as another issue, according to Wellcome Trust policy officer Will Greenacre, who says researchers worry about getting sued. “The right to read is the right to mine” is a mantra of pro-content mining organisations, which argue that since people are legally allowed to copy down facts and notes from articles with a pen and paper to reach conclusions, a computer programme should be allowed to perform the same function. SOURCE

An overview of the issues round content mining being debated in the European Parliament and Commission.  It’s gratifying to see our term “Content mining” now being widely used, and our mantra ‘The Right to Read is the Right to to Mine’.  Also the piece has picked up one of Peter Murray Rust’s webinars and highlighted our experience that ‘anyone can learn TDM in 30 minutes’. SOURCE

Ross Mounce states it this interview:- “My current area of research is phyloinformatics, and I’m a postdoc in the Wills group at the University of Bath. I take published evolutionary trees and other evolutionary data from the academic literature and perform meta-analyses and syntheses of this information across hundreds and thousands of papers to gain higher-level insights into evolutionary patterns across different species groups. Just getting these data back into re-usable, re-computable forms from the published literature is by far and away the hardest challenge of our project. As part of the BBSRC-funded PLUTo project (Phyloinformatic Literature Unlocking Tools), I’m working with Peter Murray-Rust and the ContentMine team to develop software tools and approaches to help automate the process of finding and extracting phylogenetic data from the literature”. SOURCE

‘Parallel worlds of public and commercial bioactive chemistry data’ Ekins et al
The availability of structures and linked bioactivity data in databases is powerfully enabling for drug discovery and chemical biology. However, we now review some confounding issues with the divergent expansions of public and commercial sources of chemical structures…… The ContentMine initiative extracting molecules from documents could also further emphasize that SciFinder is perhaps no longer the definitive site for chemistry prior art checking. SOURCE
‘Section level search functionality in Europe PMC’ McEntyre et al
As the availability of open access full text research articles increases, so does the need for sophisticated search services that make the most of this new content. Here, we present a new feature available in Europe PMC that allows selected sections of full text articles to be searched, including figures and reference lists. Users can now search particular parts of an article, reducing noise and allowing fine-tuning of searches…. The ContentMine project aims to develop tools that allow extraction of facts from scientific articles and figures. The section tagger is being integrated into this project to add structure to documents before text mining is applied. SOURCE

Leave a Reply

Fill in your details below or click an icon to log in: Logo

You are commenting using your account. Log Out /  Change )

Google+ photo

You are commenting using your Google+ account. Log Out /  Change )

Twitter picture

You are commenting using your Twitter account. Log Out /  Change )

Facebook photo

You are commenting using your Facebook account. Log Out /  Change )


Connecting to %s