| CNB - Protein Design Group · The MITRE Corporation · |
|
|
| · CONTACT · |

This meeting had the aim to bring together researchers developing text data mining tools and related language processing methods to manage the information explosion in the biomedical field. It included invited and contributed papers, with a focus on developing shared infrastructure (tools, corpora, ontologies, see for example Natural Language Processing in Biology and challenge evaluations, in the style of the KDD Challenge Cups. It included talks from two related SIGs, BioPathways and BioOnotologies.

Start
End
Speakers/Authors
Affiliation
TITLE
09:00
09:15
Christian Blaschke
CNB, Spain
Welcome
09:15
09:45
Robert Stevens
U. Manchester, UK
Report from BioOntologies SIG
09:45
10:15
Ian Donaldson, Joel Martin, Berry de Bruijn, Christopher W.V. Hogue
U. Toronto, CA
PreBIND and Textomy - Mining the Biomedical Literature for Protein-Protein Interactions Using a Support Vector Machine
10:15
10:45
Andy Fulmer, Jun Xu, Steven Zhao
Procter & Gamble, US
An Overview of Text Mining in the Biology Domain at P&G
10:45
11:00
BREAK
11:00
11:15
George Demetriou, Robert Gaizauskas
U. Sheffield, UK
Corpus Resources for Development and Evaluation of a Biological Text Mining System
11:15
11:30
Yuka Tateisi, Tomoko Ohta, Jin-dong Kim, Huaquing Hong, Su Jian, Jun-ichi Tsujii
CREST, Japan Science and Technology Corporation
The GENIA Corpus: MEDLINE Abstracts Annotated with Linguistic Information
11:30
11:45
Seth Kulick, Mark Liberman, Andrew Schein
U. Pennsylvania, US
Shallow Semantic Annotation of Biomedical Corpora for Information Extraction
11:45
12:00
Discussion about the creation and use of text corpora
12:00
13:00
LUNCH
13:00
13:30
Maria Samsonova
St.Petersburg State Polytechnical University
Processing of the Natural Language Queries to a Relational Database
13:30
13:45
Tony C. Smith, John G. Cleary
U. of Waikato, NZ
Automatically Linking MEDLINE Abstracts to the Gene Ontology
13:45
14:00
Yoshimasa Tsuruoka, Teruyoshi Hishiki, Osamu Ogasawara, Kousaku Okubo
CREST, Japan Science and Technology Corporation
Integration of Diverse Knowledge and Data into Biomedical Knowledge Matrices
14:00
14:15
Eunji Yi, Gary G. Lee, Soo-Jun Park
Pohang U. of Science and Technology, Korea
HMM-based Protein Name Recognition with Edit Distance Using Automatically Annotated Corpus
14:15
14:30
Discussion
14:30
15:00
BREAK
15:00
15:15
Francisco M. Couto, Mario J. Silva, Pedro Coutinho
U of Lisbon, Portugal
Curating Extracted Information Through the Correlation between Structure and Function
15:15
15:30
Rune Linding, Peter O'Hanlon, Ulrich Reincke, Toby Gibson,
EMBL
Profiling and Classification of Scientific Documents with SAS Text Miner
15:30
15:45
Discussion
15:45
16:15
Alex Yeh, Lynette Hirschman, Alex Morgan
MITRE, US
BioCreAtIvE: Entity Extraction
16:15
16:45
Christian Blaschke, Alfonso Valencia
CNB, Spain
BioCreAtIvE: Functional Extraction
16:45
17:30
Summary and discussion

The venue was held together with ISMB03, thus the Text-Mining registration was as well along with the ordinary ISMB03 registration. ISMB conference rates were extended to cover the extra day required to attend this meeting.

Members of the text data mining community were encouraged to participate in this meeting and also to submit short papers for presentation. Special emphasis was placed on the use of text mining techniques in the discovery of highly non-trivial, novel information in biology, including relationships at the molecular, biochemical and cellular levels.
Abstracts in the following areas were submitted:
- Biological discoveries independently supported by other experimental information using text mining.
- Annotated corpora of biological text for the benchmarking of existing method.
- Approaches for database maintenance and integrity using text mining Standardization and evaluation of methods for information extract.
- Standardization and evaluation of methods for information extraction including algorithms (e.g. full parsers) and databases (e.g. portable ontologies).

- Platform presentations only.
- Submit abstracts by May 15, 2002.
- Decisions by June 1, 2002.
- Submissions to Lynette Hirschman .
- Acceptable formats: PDF, postscript, RTF, MS.
- Length: 2-4 page for abstract.
- PLEASE include name, contact details, affiliation.