Harnessing the crowds for automating the identification of Web APIs

Carlos Pedrinaci, Dong Liu, Chenghua Lin, John Domingue

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2 Citations (Scopus)

Abstract

Supporting the efficient discovery and use of Web APIs is increasingly important as their use and popularity grows. Yet, a simple task like finding potentially interesting APIs and their related documentation turns out to be hard and time consuming even when using the best resources currently available on theWeb. In this paper we describe our research towards an automatedWeb API documentation crawler and search engine. This paper presents two main contributions. First, we have devised and exploited crowdsourcing techniques to generate a curated dataset of Web APIs documentation. Second, thanks to this dataset, we have devised an engine able to automatically detect documentation pages. Our preliminary experiments have shown that we obtain an accuracy of 80% and a precision increase of 15 points over a keyword-based heuristic we have used as baseline.
Original languageEnglish
Title of host publicationPapers from the 2012 AAAI Spring Symposium
Subtitle of host publicationTechnical Report SS-12-04
Place of PublicationPaolo Alto, California
PublisherAAAI Press
Pages58-63
Number of pages6
ISBN (Print)978-1-57735-553-3
Publication statusPublished - 2012

Fingerprint

Application programming interfaces (API)
Search engines
Engines
Experiments

Cite this

Pedrinaci, C., Liu, D., Lin, C., & Domingue, J. (2012). Harnessing the crowds for automating the identification of Web APIs. In Papers from the 2012 AAAI Spring Symposium: Technical Report SS-12-04 (pp. 58-63). Paolo Alto, California: AAAI Press.

Harnessing the crowds for automating the identification of Web APIs. / Pedrinaci, Carlos; Liu, Dong; Lin, Chenghua; Domingue, John.

Papers from the 2012 AAAI Spring Symposium: Technical Report SS-12-04. Paolo Alto, California : AAAI Press, 2012. p. 58-63.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Pedrinaci, C, Liu, D, Lin, C & Domingue, J 2012, Harnessing the crowds for automating the identification of Web APIs. in Papers from the 2012 AAAI Spring Symposium: Technical Report SS-12-04. AAAI Press, Paolo Alto, California, pp. 58-63.
Pedrinaci C, Liu D, Lin C, Domingue J. Harnessing the crowds for automating the identification of Web APIs. In Papers from the 2012 AAAI Spring Symposium: Technical Report SS-12-04. Paolo Alto, California: AAAI Press. 2012. p. 58-63
Pedrinaci, Carlos ; Liu, Dong ; Lin, Chenghua ; Domingue, John. / Harnessing the crowds for automating the identification of Web APIs. Papers from the 2012 AAAI Spring Symposium: Technical Report SS-12-04. Paolo Alto, California : AAAI Press, 2012. pp. 58-63
@inproceedings{4774e71c10c143a186a2ba705a37016f,
title = "Harnessing the crowds for automating the identification of Web APIs",
abstract = "Supporting the efficient discovery and use of Web APIs is increasingly important as their use and popularity grows. Yet, a simple task like finding potentially interesting APIs and their related documentation turns out to be hard and time consuming even when using the best resources currently available on theWeb. In this paper we describe our research towards an automatedWeb API documentation crawler and search engine. This paper presents two main contributions. First, we have devised and exploited crowdsourcing techniques to generate a curated dataset of Web APIs documentation. Second, thanks to this dataset, we have devised an engine able to automatically detect documentation pages. Our preliminary experiments have shown that we obtain an accuracy of 80{\%} and a precision increase of 15 points over a keyword-based heuristic we have used as baseline.",
author = "Carlos Pedrinaci and Dong Liu and Chenghua Lin and John Domingue",
year = "2012",
language = "English",
isbn = "978-1-57735-553-3",
pages = "58--63",
booktitle = "Papers from the 2012 AAAI Spring Symposium",
publisher = "AAAI Press",

}

TY - GEN

T1 - Harnessing the crowds for automating the identification of Web APIs

AU - Pedrinaci, Carlos

AU - Liu, Dong

AU - Lin, Chenghua

AU - Domingue, John

PY - 2012

Y1 - 2012

N2 - Supporting the efficient discovery and use of Web APIs is increasingly important as their use and popularity grows. Yet, a simple task like finding potentially interesting APIs and their related documentation turns out to be hard and time consuming even when using the best resources currently available on theWeb. In this paper we describe our research towards an automatedWeb API documentation crawler and search engine. This paper presents two main contributions. First, we have devised and exploited crowdsourcing techniques to generate a curated dataset of Web APIs documentation. Second, thanks to this dataset, we have devised an engine able to automatically detect documentation pages. Our preliminary experiments have shown that we obtain an accuracy of 80% and a precision increase of 15 points over a keyword-based heuristic we have used as baseline.

AB - Supporting the efficient discovery and use of Web APIs is increasingly important as their use and popularity grows. Yet, a simple task like finding potentially interesting APIs and their related documentation turns out to be hard and time consuming even when using the best resources currently available on theWeb. In this paper we describe our research towards an automatedWeb API documentation crawler and search engine. This paper presents two main contributions. First, we have devised and exploited crowdsourcing techniques to generate a curated dataset of Web APIs documentation. Second, thanks to this dataset, we have devised an engine able to automatically detect documentation pages. Our preliminary experiments have shown that we obtain an accuracy of 80% and a precision increase of 15 points over a keyword-based heuristic we have used as baseline.

M3 - Conference contribution

SN - 978-1-57735-553-3

SP - 58

EP - 63

BT - Papers from the 2012 AAAI Spring Symposium

PB - AAAI Press

CY - Paolo Alto, California

ER -