GenGIS: A geospatial information system for genomic data

  1. Donovan H. Parks1,
  2. Michael Porter1,
  3. Sylvia Churcher1,
  4. Suwen Wang1,
  5. Christian Blouin1,
  6. Jacqueline Whalley2,
  7. Stephen Brooks1 and
  8. Robert Beiko1,3
  1. 1 Dalhousie University;
  2. 2 Auckland University of Technology
  1. * Corresponding author; email: beiko{at}cs.dal.ca

Abstract

The increasing availability of genetic sequence data associated with explicit geographic and ecological information is offering new opportunities to study the processes that shape biodiversity. The generation and testing of hypotheses using these data sets requires effective tools for mathematical and visual analysis which can integrate digital maps, ecological data, and large genetic, genomic or metagenomic data sets. GenGIS is a free and open-source software package that supports the integration of digital map data with genetic sequences and environmental information from multiple sample sites. Essential bioinformatic and statistical tools are integrated into the software, allowing the user a wide range of analysis options for their sequence data. Data visualizations are combined with the cartographic display to yield a clear view of the relationship between geography and genomic diversity, with a particular focus on the hierarchical clustering of sites based on their similarity or phylogenetic proximity. Here we outline the features of GenGIS and demonstrate its application to georeferenced microbial metagenomic, HIV-1, and human mitochondrial DNA data sets.

Footnotes

    • Received May 3, 2009.
    • Accepted July 16, 2009.

Articles citing this article

ACCEPTED MANUSCRIPT

Preprint Server