Navigation Links
Building a protein name dictionary from full text: a machine learning term extraction approach

The majority of information in the biological literature resides in full text articles, instead of abstracts. Yet, abstracts remain the focus of many publicly available literature data mining tools. Most literature mining tools rely on pre-existing lexicons of biological names, often extracted from curated gene or protein databases. This is a limitation, because such databases have low coverage of the many name variants which are used to refer to biological entities in the literature.

Results
We present an approach to recognize named entities in full text. The approach collects high frequency terms in an article, and uses support vector machines (SVM) to identify biological entity names. It is also computationally efficient and robust to noise commonly found in full text material. We use the method to create a protein name dictionary from a set of 80,528 full text articles. Only 8.3% of the names in this dictionary match SwissProt description lines. We assess the quality of the dictionary by studying its protein name recognition performance in full text.
Conclusions
This dictionary term lookup method compares favorably to other published methods, supporting the significance of our direct extraction approach. The method is strong in recognizing name variants not found in SwissProt.


'"/>

Source:BMC Bioinformatics


Page: 1

Related biology news :

1. Whats really making you sick? Plant pathologists offer the science behind Sick Building Syndrome
2. Building a human kinase gene repository
3. Building a better mouse model of lung cancer: FHIT counts
4. Protein folding: Building a strong foundation
5. New, automated tool successfully classifies and relates proteins in unprecedented way
6. New binding target for oncogenic viral protein
7. Controversial drug shown to act on brain protein to cut alcohol use
8. Timing is everything: First step in protein building revealed
9. UWs Rosetta software to unlock secrets of many human proteins
10. Researchers find how protein allows insects to detect and respond to pheromones
11. Signaling protein builds bigger, better bones in mice
Post Your Comments:
*Name:
*Comment:
*Email:


(Date:4/18/2017)... Calif. , April 18, 2017  Socionext Inc., a global ... of a media edge server, the M820, which features the company,s ... recognition software provided by Tera Probe, Inc., will be showcased during ... at the NAB show at the Las Vegas ... ...
(Date:4/11/2017)... DUBLIN , Apr. 11, 2017 Research ... Tracking Market 2017-2021" report to their offering. ... The global eye tracking market to grow at ... The report, Global Eye Tracking Market 2017-2021, has been prepared based ... report covers the market landscape and its growth prospects over the ...
(Date:4/5/2017)... 5, 2017 Today HYPR Corp. , ... server component of the HYPR platform is officially ... end-to-end security architecture that empowers biometric authentication across Fortune ... already secured over 15 million users across the financial ... connected home product suites and physical access represent a ...
Breaking Biology News(10 mins):
(Date:10/11/2017)... and LAGUNA HILLS, Calif. , Oct. 11, ... Research, London (ICR) and University of ... SkylineDx,s prognostic tool to risk-stratify patients with multiple myeloma (MM), ... nine . The University of Leeds ... funded by Myeloma UK, and ICR will perform the testing ...
(Date:10/10/2017)... ... October 10, 2017 , ... ... (ADC) therapeutics, today confirmed licensing rights that give it exclusive global access ... developed in collaboration with Children’s Hospital Los Angeles (CHLA). Additionally, an ...
(Date:10/10/2017)... ... 10, 2017 , ... USDM Life Sciences , the ... sciences and healthcare industries, announces a presentation by Subbu Viswanathan and Jennifer Jaye ... GxP Validation for Agile Cloud Platforms,” will present a revolutionary approach to achieving ...
(Date:10/9/2017)... Phoenix, Arizona (PRWEB) , ... October 09, 2017 ... ... of Kindred, a four-tiered line of medical marijuana products targeting the needs of ... production and packaging of Kindred takes place in Phoenix, Arizona. , As operators ...
Breaking Biology Technology: