TY - JOUR A1 - Kasprzyk, Arek A1 - Keefe, Damian A1 - Smedley, Damian A1 - London, Darin A1 - Spooner, William A1 - Melsopp, Craig A1 - Hammond, Martin A1 - Rocca-Serra, Philippe A1 - Cox, Tony A1 - Birney, Ewan T1 - EnsMart: A Generic System for Fast and Flexible Access to Biological Data Y1 - 2004/01/01 JF - Genome Research JO - Genome Research SP - 160 EP - 169 DO - 10.1101/gr.1645104 VL - 14 IS - 1 UR - http://genome.cshlp.org/content/14/1/160.abstract N2 - The EnsMart system (www.ensembl.org/EnsMart) provides a generic data warehousing solution for fast and flexible querying of large biological data sets and integration with third-party data and tools. The system consists of a query-optimized database and interactive, user-friendly interfaces. EnsMart has been applied to Ensembl, where it extends its genomic browser capabilities, facilitating rapid retrieval of customized data sets. A wide variety of complex queries, on various types of annotations, for numerous species are supported. These can be applied to many research problems, ranging from SNP selection for candidate gene screening, through cross-species evolutionary comparisons, to microarray annotation. Users can group and refine biological data according to many criteria, including cross-species analyses, disease links, sequence variations, and expression patterns. Both tabulated list data and biological sequence output can be generated dynamically, in HTML, text, Microsoft Excel, and compressed formats. A wide range of sequence types, such as cDNA, peptides, coding regions, UTRs, and exons, with additional upstream and downstream regions, can be retrieved. The EnsMart database can be accessed via a public Web site, or through a Java application suite. Both implementations and the database are freely available for local installation, and can be extended or adapted to `non-Ensembl' data sets. ER -