CNB - Protein Design Group  · The MITRE Corporation ·

name


 ·  CONTACT  · 


name

This meeting had the aim to bring together researchers developing text data mining tools and related language processing methods to manage the information explosion in the biomedical field. It included invited and contributed papers, with a focus on developing shared infrastructure (tools, corpora, ontologies, see for example Natural Language Processing in Biology and challenge evaluations, in the style of the KDD Challenge Cups. It included talks from two related SIGs, BioPathways and BioOnotologies.

 
 
 

name

The full version of the handouts of the SIG meeting on Text Mining at ISMB2003 in Brisbane (Friday, June 27, 2003) can be downloaded as a Microsoft Word document or accessed as a html page, were links to abstracts and presentations are provided as well.

The time schedule of the meeting programm was the following:

Start

End

Speakers/Authors

Affiliation

TITLE

09:00

09:15

Christian Blaschke

CNB, Spain

Welcome

09:15

09:45

Robert Stevens

U. Manchester, UK

Report from BioOntologies SIG

09:45

10:15

Ian Donaldson, Joel Martin, Berry de Bruijn, Christopher W.V. Hogue

U. Toronto, CA

PreBIND and Textomy - Mining the Biomedical Literature for Protein-Protein Interactions Using a Support Vector Machine

10:15

10:45

Andy Fulmer, Jun Xu, Steven Zhao

Procter & Gamble, US

An Overview of Text Mining in the Biology Domain at P&G

10:45

11:00

BREAK



11:00

11:15

George Demetriou, Robert Gaizauskas

U. Sheffield, UK

Corpus Resources for Development and Evaluation of a Biological Text Mining System

11:15

11:30

Yuka Tateisi, Tomoko Ohta, Jin-dong Kim, Huaquing Hong, Su Jian, Jun-ichi Tsujii

CREST, Japan Science and Technology Corporation

The GENIA Corpus: MEDLINE Abstracts Annotated with Linguistic Information

11:30

11:45

Seth Kulick, Mark Liberman, Andrew Schein

U. Pennsylvania, US

Shallow Semantic Annotation of Biomedical Corpora for Information Extraction

11:45

12:00



Discussion about the creation and use of text corpora

12:00

13:00

LUNCH



13:00

13:30

Maria Samsonova

St.Petersburg State Polytechnical University

Processing of the Natural Language Queries to a Relational Database

13:30

13:45

Tony C. Smith, John G. Cleary

U. of Waikato, NZ

Automatically Linking MEDLINE Abstracts to the Gene Ontology

13:45

14:00

Yoshimasa Tsuruoka, Teruyoshi Hishiki, Osamu Ogasawara, Kousaku Okubo

CREST, Japan Science and Technology Corporation

Integration of Diverse Knowledge and Data into Biomedical Knowledge Matrices

14:00

14:15

Eunji Yi, Gary G. Lee, Soo-Jun Park

Pohang U. of Science and Technology, Korea

HMM-based Protein Name Recognition with Edit Distance Using Automatically Annotated Corpus

14:15

14:30



Discussion

14:30

15:00

BREAK



15:00

15:15

Francisco M. Couto, Mario J. Silva, Pedro Coutinho

U of Lisbon, Portugal

Curating Extracted Information Through the Correlation between Structure and Function

15:15

15:30

Rune Linding, Peter O'Hanlon, Ulrich Reincke, Toby Gibson,

EMBL

Profiling and Classification of Scientific Documents with SAS Text Miner

15:30

15:45



Discussion

15:45

16:15

Alex Yeh, Lynette Hirschman, Alex Morgan

MITRE, US

BioCreAtIvE: Entity Extraction

16:15

16:45

Christian Blaschke, Alfonso Valencia

CNB, Spain

BioCreAtIvE: Functional Extraction

16:45

17:30



Summary and discussion


 
 
 

name

The venue was held together with ISMB03, thus the Text-Mining registration was as well along with the ordinary ISMB03 registration. ISMB conference rates were extended to cover the extra day required to attend this meeting.
 
 
 

name

Members of the text data mining community were encouraged to participate in this meeting and also to submit short papers for presentation. Special emphasis was placed on the use of text mining techniques in the discovery of highly non-trivial, novel information in biology, including relationships at the molecular, biochemical and cellular levels.

Abstracts in the following areas were submitted:
  • Biological discoveries independently supported by other experimental information using text mining.
  • Annotated corpora of biological text for the benchmarking of existing method.
  • Approaches for database maintenance and integrity using text mining Standardization and evaluation of methods for information extract.
  • Standardization and evaluation of methods for information extraction including algorithms (e.g. full parsers) and databases (e.g. portable ontologies).
 
 
 

name

  • Platform presentations only.
  • Submit abstracts by May 15, 2002.
  • Decisions by June 1, 2002.
  • Submissions to Lynette Hirschman .
  • Acceptable formats: PDF, postscript, RTF, MS.
  • Length: 2-4 page for abstract.
  • PLEASE include name, contact details, affiliation.

 
 
 
last update: 2003/07/31
© by Martin Krallinger
martink@cnb.uam.es