Introduction
In the last three decades, a large number of
taxonomic databases have been developed to
address curatorial management processes,
taxonomic revisions and applied biology needs
(Pankhurst, 1991) as well as the growing demand
for large-scale global biodiversity information
systems accessible over the World Wide Web
(Bisby, 2000; Curry & Humphries, 2007).
A basic step towards developing a taxonomic
database system is to build a data model to
describe the entities involved and relationships
among them. This has lead to the development of
several models for the design of taxonomic
databases (Allkin & Bisby, 1988; Allkin & White,
1982, 1988; Berendsohn, 1997; Berendsohn et al.,
1999). Although no doubt useful in clarifying
relationships among taxonomic entities and their
attributes, these models are, however, invariably
of such complexity as to make them rather
difficult to implement and manage (Morris, 2005).
More recently, this approach has been
expanded towards the development of
comprehensive solutions for taxonomic
computing, as the Scratchpad (Smith et al., 2009)
and the EDIT Platform for Cybertaxonomy
(Berendsohn, 2010; Berendsohn et al., 2011). The
former is built on top of the generic content
management system Drupal, whereas the latter
comprises both a common data model and a set of
specialized software tools for interacting with it.
These solutions, however, still do not offer the
average user an independent environment, at the
same time simple to use and to maintain, for
dealing with the complexity of taxonomic data, as
nomenclatural information, the taxonomic
hierarchy, structured and unstructured descriptive
data, geographic information, literature citations,
and ecological and genomic data.
The ACACIA design is an attempt to
overcome this difficulty, offering a simple but
flexible and extensible data model that can be
used as a framework for taxonomic information
systems. ACACIA intends to be a practical
implementation of the ideal of a "universal
biological database structure" (White & Allkin,
1993).
The name "Acacia" is an allusion to "Baobab",
a comprehensive database design for biologists
developed by Allkin & White (1982, 1988) and
partially implemented in the Alice species
diversity database management system (White &
Allkin, 1993; White et al., 1993). The ACACIA
model can be conceived of as a simpler version of
the full BAOBAB design. The genus Acacia has
also been frequently used as an example in
taxonomic database studies (Allkin et al., 1992).
Materials and Methods
As a database scheme, ACACIA is a set of 15
entities or tables based on the relational database
model (Codd, 1970), designed to convey all basic
classes of information required for taxonomic
databases and facilitate the recording of
taxonomic data from literature and other sources
(biological collections, field surveys, and other
databases). Four tables in the set are mandatory in
any ACACIA database: species (storing valid, ie.,
currently accepted, species names), synonyms