Vascular plants dataset of the herbarium (HSS) of Agrarian Research Institute Finca “La Orden-Valdesequera” (CICYTEX), Extremadura, Spain

Abstract The HSS herbarium database includes 69,397 records of vascular plant taxa, representing 91.1% of the herbarium’s specimens as for December, 2019, which are available through the Global Biodiversity Information Facility (GBIF) website (accessible at https://doi.org/10.15468/siye1z). The database represents 4,343 species and 787 infraspecific taxa (530 subspecies, 130 varieties and 127 notho-species or hybrids) of 196 families and 1,164 genera, and 105 type sheets. So far, 97.7% of the databased records are georeferenced (geographic coordinates or MRGS coordinates) and the geographic area with the largest number of specimens is the southwest quadrant of the Iberian Peninsula (Spain and Portugal).

(Badajoz, Extremadura, Spain). Later, 1995, the HSS herbarium was transferred to its current location, in the Finca La Orden (Guadajira, Badajoz), Institute of Agricultural Research Finca La Orden-Valdesequera of the Centre for Scientific and Technological Research of Extremadura (CICYTEX, Junta de Extremadura).
The vascular plant collection of the HSS herbarium is the result of research carried out over the last 25 years by the Department of Forest Production and Biodiversity. This research included studies on plant diversity Vila-Viçosa et al. 2014) in the predominant forest systems of the southwest Iberian Peninsula, taxonomy, plants of ethnobotanical interest and their potential use as new crops, and ecosystem conservation centered on the study of endemic, rare or threatened species and the impact of potentially invasive species (Pinto-Gomes et al. 2006;Blanco and Vázquez 2014;Vázquez et al. 2015).
The main aim of this paper is to provide a vision about the specimens conserved in the HSS herbarium (diversity, distribution and types), and its potential uses in taxonomical, chorological and ecological studies.

Taxonomic coverage
The HSS herbarium collection database contains 69,397 records belonging to 196 families, 1,164 genera, 4,343 species and 787 infraspecific taxa (530 subspecies, 130 varieties, and 127 notho-species or hybrids). Of the specimens in the collection 98.5% are identified at species level.

Geographical coverage
The geographical data are structured in the specimens on three levels: geographical coordinates (longitude, latitude, WGS84 datum), MGRS (Military Grid Reference System) coordinates with an accuracy of 10,000 or 1,000 meters (ETRS89 datum) and locality assignment indicating the continent, country, province, municipality and town.
The HSS herbarium database only includes 60 records without continent or country data, 244 records without province data and 466 records without municipality or town data. 40% of the records are georeferenced with geographic coordinates and MRGS coordinates with precision of 1,000 meters, 57.7% of the records are georeferenced with MRGS coordinate assignment with precision of 10.000 meters, and only 2.3% of the records lack coordinates.   The geographical distribution of the materials preserved in the HSS herbarium is concentrated in the European continent (67,914 specimens, 97.8%), with small collections from other continents: Africa (959 specimens, 1.3%), North America (432 specimens, 0.6%), South America (47 specimens, 0.07%), Asia (25 specimens, 0.04%), and Oceania (20 specimens, 0.03%).
In addition, there is notable representation from North Africa [Morocco (748 records) and Tunisia (209 records)], linked to collection trips for the study of the flora of the Atlas Mountains, being important in the description of new species and subspecies (Vázquez and Devesa 1997;Vázquez and Ramos 2007;Vázquez et al. 2012).

Temporal coverage
The HSS herbarium database includes specimens from 1906 to 2019, distributed in two periods: before 1990 the collection was growing from exchanges and donations from private herbaria (1,353 specimens), and after 1990 it was growing due to floristics and research activities (67,813 specimens). Finally, there are 231 specimens without collection data.
Between 1990 and 2019, the period of greatest activity and growth in the collection are the five-year period 2006-2010, with collections of more than 3,500 specimens per year, linked to the study of the unique and threatened flora and the state of conservation of the predominant habitats in Extremadura (Palacios et al. 2010) (Fig. 4).
The monthly distribution of the specimens preserved in the HSS herbarium shows that the months with the highest collection activity are March, April, May and June corresponding to spring in the northern hemisphere.

Plant processing procedures
The methodology used at the HSS herbarium for preserving specimens involves pressing and drying the fresh materials. To do this, the fresh material is placed between sheets of blotting paper and thick cardboard, including a sheet of corrugated aluminum foil for every 10-15 specimens, to facilitate drying (Singh and Subramaniam 2008). Drying is done at room temperature, with hot air used in exceptional cases. After pressing and drying, the material is subjected to freezing (-40 °C) for 48 hours to facilitate the conservation of the material, thus avoiding insect or fungal attack. This process (adapted from Forman and Bridson 1998) is repeated every 8-12 months until the specimens are accessioned.

Quality control
1. Classification and identification phase. For the inclusion of plant material in the HSS herbarium, taxonomic identification is required, including assignment to genus, species, or to an infraspecific level where necessary. Works used in the identification process are referenced. For peninsular flora, the works of Amaral (1971Amaral ( -2003, Castroviejo (1986-2019), Devesa (1995), Tutin et al. (1964Tutin et al. ( -1980, Valdés et al. (1987) are used. In addition, the determination of synonyms and taxonomic authors are con- 2. Georeferencing process. Approximately 98% of the specimens preserved in the HSS herbarium contain information regarding coordinates (UTM or geographic), with different levels of accuracy. There are 40% (27,761 specimens) with geographic coordinates and a level of uncertainty lower than 100 m. The working methodology followed for the georeferencing of the new data has been based on the collection of geographical coordinates (WGS84) with GPS in the field of sampling and collection points. Subsequently, at the laboratory, using web geoportals (https://www.ign.es/iber-pix2/visor/, for Spain and http://geoportal.lneg.pt/geoportal/mapas/index.html for Portugal), province, municipality, town and altitude are assigned to each new record. For older specimens which did not have coordinates recorded in the field, the GEOLocate application (Rios and Bart 2010), and web geoportals, previously indicated, are used to assign geographic coordinates. Finally, with the help of geographic information systems such as QGIS (https://qgis.org/en/site/) and shapefile layers of MGRS coordinates (precise to 1 km 2 ) obtained from the National Geospatial-Intelligence Agency (USA) (https://earth-info.nga.mil/), the UTM1x1 coordinates of the collection point are assigned.
3. Computerization and web publication (GBIF). The database has Access support following Darwin Core standards (https://dwc.tdwg.org/) on biological biodiversity, and the data are periodically reviewed using OpenRefine (https://openrefine.org/). For the publication of the HSS database in the GBIF portal, the Integrated Publishing Toolkit (IPT) portal of the Spanish GBIF network (https://ipt.gbif.es/resource?r=hss) is used.

Storage
The herbarium storage room is equipped with humidity and temperature control (30% humidity and 10 °C temperature), hermetically sealed mobile shelves and cardboard boxes. The specimens are organized into four groups (ferns, conifers, angiospermsmonocots and other angiosperms). Within each group specimens are ordered alphabetically following the sequence: families, genera, species, subspecies, varieties, and forms.
Regarding the flora of Extremadura, the herbarium holds specimens of 2986 taxa and 113 hybrids, approximately 98% of the taxa known from Extremadura (Castroviejo 1986(Castroviejo -2019Devesa 1995). In addition, the areas with the densest plant collections are located in protected areas with the highest plant diversity and best preserved in the region. These areas include the southern slope of the Gredos Mountain range, Gata Mountain range, Villuercas Mountain range, Badajoz Mountain range, the foothills of Sierra Morena, and Guadiana River valley (Fig. 5).
The HSS collection of vascular plants has served as the basis for various scientific works on the flora and vegetation of Extremadura and the bordering territories of Spain and Portugal. Among them are: the reviews and studies of the family Orchidaceae (Vázquez and Ramos 2005;Vázquez 2007Vázquez , 2008aVázquez , 2008bVázquez , 2009Vázquez et al. 2012), the genus Quercus (Vázquez et al. 1993(Vázquez et al. , 2004Vázquez 1995;Vázquez and Coombes 2016), Stipa L. (Vázquez and Devesa 1997;Vázquez and Ramos 2007), Narcissus Vázquez 2013), Bromus (Vázquez and Scholz 2008), Thymus (Blanco et al. 2007), Thymbra L. (Blanco et al. 2007), Scolymus Tourn ex L. (Vázquez 2000), Typha Tourn ex L. (Vázquez 2012), Taraxacum F.H.Wigg (Vázquez 2014), Festuca L. (Vázquez and García 2016), Vitis L. , and Callitriche L. (Márquez et al. 2017) In addition, the HSS herbarium actively participates in the "Flora Iberica" project by providing material for the study of various genera of the Iberian Peninsula and Balearic Islands. The work carried out in the HSS herbarium allowed for the creation, in 2007, of the scientific journal "Folia Botanica Extremadurensis" a journal dedicated to scientific works and studies on the flora and vegetation of the southwest Iberian Peninsula.  (1)], Fagaceae (15 type sheets, genus Quercus), and Amaryllidaceae (10 type sheets, genus Narcissus). All type sheets in the HSS herbarium are from Spain (89 type sheets, 77 of them from Extremadura), Portugal (10 type sheets), Morocco (5 type sheets), and Tunisia (1 type sheets).

Maintenance and future work
Currently, the HSS herbarium has more than 6,000 sheets, corresponding to the collection trips of 2018-2019, which are not included in the database. The most immediate work is focused updating the database to include these records, and its subsequent updating in the GBIF network. Digitisation of the collection is currently prioritized as well.
Finally, the maintenance of the collection represents processing between 2,000-3,000 specimens annually.