Language Resources and Chemical Informatics

C.J. Rupp, Ann Copestake, Peter Corbett, Peter Murray-Rust, Advaith Siddharthan, Simone Teufel, Benjamin Waldron

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1 Citation (Scopus)

Abstract

Chemistry research papers are a primary source of information about chemistry, as in any scientic eld. The presentation of the data is, predominantly, unstructured information, and so not immediately susceptible to processes developed within chemical informatics for carrying out chemistry research by information processing techniques. At one level, extracting the relevant information from research papers is a text mining task, requiring both extensive language resources and specialised knowledge of the subject domain. However,
the papers also encode information about the way the research is conducted and the structure of the eld itself. Applying language technology to research papers in chemistry can facilitate eScience on several different levels. The SciBorg project sets out to provide an extensive, analysed corpus of published chemistry research. This relies on the cooperation of several journal publishers to provide papers in an appropriate form. The work is carried out as a collaboration involving the Computer Laboratory, Chemistry Department and eScience Centre at Cambridge University, and is funded under the UK eScience programme.
Original languageEnglish
Title of host publicationProceedings of the 6th International Conference on Language Resources and Evaluation (LREC 2008)
Place of PublicationParis, France
PublisherELDA
Pages2196-2200
Publication statusPublished - 2008
Event6th International Conference on Language Resources and Evaluation (LREC'2008) - Marrakesh, Morocco
Duration: 28 May 200830 May 2008

Conference

Conference6th International Conference on Language Resources and Evaluation (LREC'2008)
CountryMorocco
CityMarrakesh
Period28/05/0830/05/08

Cite this

Rupp, C. J., Copestake, A., Corbett, P., Murray-Rust, P., Siddharthan, A., Teufel, S., & Waldron, B. (2008). Language Resources and Chemical Informatics. In Proceedings of the 6th International Conference on Language Resources and Evaluation (LREC 2008) (pp. 2196-2200). Paris, France: ELDA.

Language Resources and Chemical Informatics. / Rupp, C.J.; Copestake, Ann; Corbett, Peter; Murray-Rust, Peter; Siddharthan, Advaith; Teufel, Simone; Waldron, Benjamin.

Proceedings of the 6th International Conference on Language Resources and Evaluation (LREC 2008). Paris, France : ELDA, 2008. p. 2196-2200.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Rupp, CJ, Copestake, A, Corbett, P, Murray-Rust, P, Siddharthan, A, Teufel, S & Waldron, B 2008, Language Resources and Chemical Informatics. in Proceedings of the 6th International Conference on Language Resources and Evaluation (LREC 2008). ELDA, Paris, France, pp. 2196-2200, 6th International Conference on Language Resources and Evaluation (LREC'2008), Marrakesh, Morocco, 28/05/08.
Rupp CJ, Copestake A, Corbett P, Murray-Rust P, Siddharthan A, Teufel S et al. Language Resources and Chemical Informatics. In Proceedings of the 6th International Conference on Language Resources and Evaluation (LREC 2008). Paris, France: ELDA. 2008. p. 2196-2200
Rupp, C.J. ; Copestake, Ann ; Corbett, Peter ; Murray-Rust, Peter ; Siddharthan, Advaith ; Teufel, Simone ; Waldron, Benjamin. / Language Resources and Chemical Informatics. Proceedings of the 6th International Conference on Language Resources and Evaluation (LREC 2008). Paris, France : ELDA, 2008. pp. 2196-2200
@inproceedings{4b45b4b712304898ab7be8d7432e2e5b,
title = "Language Resources and Chemical Informatics",
abstract = "Chemistry research papers are a primary source of information about chemistry, as in any scientic eld. The presentation of the data is, predominantly, unstructured information, and so not immediately susceptible to processes developed within chemical informatics for carrying out chemistry research by information processing techniques. At one level, extracting the relevant information from research papers is a text mining task, requiring both extensive language resources and specialised knowledge of the subject domain. However, the papers also encode information about the way the research is conducted and the structure of the eld itself. Applying language technology to research papers in chemistry can facilitate eScience on several different levels. The SciBorg project sets out to provide an extensive, analysed corpus of published chemistry research. This relies on the cooperation of several journal publishers to provide papers in an appropriate form. The work is carried out as a collaboration involving the Computer Laboratory, Chemistry Department and eScience Centre at Cambridge University, and is funded under the UK eScience programme.",
author = "C.J. Rupp and Ann Copestake and Peter Corbett and Peter Murray-Rust and Advaith Siddharthan and Simone Teufel and Benjamin Waldron",
year = "2008",
language = "English",
pages = "2196--2200",
booktitle = "Proceedings of the 6th International Conference on Language Resources and Evaluation (LREC 2008)",
publisher = "ELDA",

}

TY - GEN

T1 - Language Resources and Chemical Informatics

AU - Rupp, C.J.

AU - Copestake, Ann

AU - Corbett, Peter

AU - Murray-Rust, Peter

AU - Siddharthan, Advaith

AU - Teufel, Simone

AU - Waldron, Benjamin

PY - 2008

Y1 - 2008

N2 - Chemistry research papers are a primary source of information about chemistry, as in any scientic eld. The presentation of the data is, predominantly, unstructured information, and so not immediately susceptible to processes developed within chemical informatics for carrying out chemistry research by information processing techniques. At one level, extracting the relevant information from research papers is a text mining task, requiring both extensive language resources and specialised knowledge of the subject domain. However, the papers also encode information about the way the research is conducted and the structure of the eld itself. Applying language technology to research papers in chemistry can facilitate eScience on several different levels. The SciBorg project sets out to provide an extensive, analysed corpus of published chemistry research. This relies on the cooperation of several journal publishers to provide papers in an appropriate form. The work is carried out as a collaboration involving the Computer Laboratory, Chemistry Department and eScience Centre at Cambridge University, and is funded under the UK eScience programme.

AB - Chemistry research papers are a primary source of information about chemistry, as in any scientic eld. The presentation of the data is, predominantly, unstructured information, and so not immediately susceptible to processes developed within chemical informatics for carrying out chemistry research by information processing techniques. At one level, extracting the relevant information from research papers is a text mining task, requiring both extensive language resources and specialised knowledge of the subject domain. However, the papers also encode information about the way the research is conducted and the structure of the eld itself. Applying language technology to research papers in chemistry can facilitate eScience on several different levels. The SciBorg project sets out to provide an extensive, analysed corpus of published chemistry research. This relies on the cooperation of several journal publishers to provide papers in an appropriate form. The work is carried out as a collaboration involving the Computer Laboratory, Chemistry Department and eScience Centre at Cambridge University, and is funded under the UK eScience programme.

M3 - Conference contribution

SP - 2196

EP - 2200

BT - Proceedings of the 6th International Conference on Language Resources and Evaluation (LREC 2008)

PB - ELDA

CY - Paris, France

ER -