The Bug Catalog of the Maven Ecosystem

by Mitropoulos, Dimitris and Karakoidas, Vassilios and Louridas, Panos and Gousios, Georgios and Spinellis, Diomidis

You can get a pre-print version from here.
You can view the publisher's page here.

Abstract

Examining software ecosystems can provide the research community with data regarding artifacts, processes, and communities. We present a dataset obtained from the Maven central repository ecosystem (approximately 265GB of data) by statically analyzing the repository to detect potential software bugs. For our analysis we used FindBugs, a tool that examines Java bytecode to detect numerous types of bugs. The dataset contains the metrics results that Find- Bugs reports for every project version (a jar) included in the ecosystem. For every version we also stored specific metadata such as the jar’s size, its dependencies and others. Our dataset can be used to produce interesting research results, as we show in specific examples.

Bibtex record

@inproceedings{MKLGS14,
  author = {Mitropoulos, Dimitris and Karakoidas, Vassilios and Louridas, Panos and Gousios, Georgios and Spinellis, Diomidis},
  title = {The Bug Catalog of the Maven Ecosystem},
  booktitle = {Proceedings of the 11th Working Conference on Mining Software Repositories},
  series = {MSR 2014},
  year = {2014},
  isbn = {978-1-4503-2863-0},
  location = {Hyderabad, India},
  pages = {372--375},
  numpages = {4},
  doi = {10.1145/2597073.2597123},
  acmid = {2597123},
  publisher = {ACM},
  address = {New York, NY, USA},
  keywords = {FindBugs, Maven Repository, Software Bugs},
  url = {/pub/maven-findbugs.pdf}
}

The paper