High-frequency hydrodynamic and hydrochemical data from karst unsaturated zone
Leïla Serène, Guillaume Cinkus, Naomi Mazzilli, Jean-Baptiste Decitre, Christophe Emblanch, Franck Tison, Julien Dupont, Milanka Babic, Roland Simler, Matthieu Blanc

TL;DR
This paper presents a unique dataset of high-frequency hydrological and weather data from the unsaturated zone of a karst system in France, offering insights into water flow and climate interactions.
Contribution
The study provides a rare, continuous dataset of unsaturated zone flows in a karst system, including slow flows typically difficult to monitor.
Findings
The dataset includes electrical conductivity, temperature, discharge, and weather parameters from July 2022 to April 2024.
Data from slow flows in limestone matrix and various unsaturated zone flows are captured, which are rarely monitored.
The dataset can be used to study pressure and mass transfer in karst systems and improve climate models.
Abstract
This dataset provides hydrological continuous data for 12 unsaturated zone flows of the Fontaine the Vaucluse karst system (France). These flows are caused by faults, fractures, karst conduits, and slow-flows from porous rock. This panel of unsaturated zone flow is unique. The dataset includes electrical conductivity, temperature and discharge, obtained through the artificial galleries of the Low Noise Underground Laboratory (LSBB, https://lsbb.cnrs.fr), and providing a unique window into the hydrological processes occurring in the unsaturated zone of karst aquifers. It also provides air pressure, temperature and humidity inside the galleries and weather parameters at the surface above the galleries (pluviometry, temperature, solar radiation, humidity, wind speed, wind direction, barometric pressure). The data are provided at an hourly time step from July 2022 to April 2024 and…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsKarst Systems and Hydrogeology · Hydrocarbon exploration and reservoir analysis · Groundwater flow and contamination studies
Specifications TableSubjectEnvironmental ScienceSpecific subject areaHydrodynamic and hydrochemistry data in hydrogeology, weather data in climatology, karst hydrogeology, karst spring, unsaturated zoneType of dataTable, filtered dataData collectionThe Low Noise Underground Laboratory (LSBB) has 3 galleries allowing the monitoring of Fontaine de Vaucluse karst’s unsaturated zone flows (electrical conductivity, temperature (C4E probe, Ponsel), discharge with homemade flowmeter (plexiglass tube and PFT pressure sensor, Sick)). Air pressure, temperature and humidity inside the galleries is monitored (Yocto-meteo sensor, Yoctopuce). Surface weather (pluviometry, temperature, solar radiation, humidity, wind speed, wind direction, and barometric pressure) is monitored by a weather station (MetPak Pro, integrated Windsonic, Gill Instruments). Hourly data (Jul 2022-Apr 2024) was checked using manual measurements and outliers removed.Data source locationWhere the data are collected:
- •Institution : Laboratoire LSBB UAR3538 CNRS
- •City/Town/Region: Rustrel, 84400
- •Region: Provence-Alpes-Côtes d’Azur
- •Country: France Geographical coordinated of the LSBB’s galleries entrance:Latitude : 43°55′43.11"NLongitude : 5°29′14.36"EData accessibilityRepository name: OSURISData identification number & direct URL to data:
- •Groundwater data from LSBB’s flows [1]:https://doi.org/10.26169/hplus.LSBB_water_high-frequency_monitoring
- •Data from the weather on the surface above the galleries and from the air inside the galleries [2]:https://doi.org/10.26169/hplus.LSBB_air_weather_high-frequency_monitoring Instructions for accessing these data:Data can be downloaded from the "Download Links" box at the bottom of each web page accessible by clicking on the links above.
Value of the Data
1
- •Karst unsaturated zone flows are usually instrumented via speleological access (e.g. caves), which constrains both the types and number of monitored points. Speleological access is a major difficulty for sampling in unsaturated zones, as it involves longer access times and higher costs due to the difficulties of access, often requiring the presence of specialized technical personnel (speleologists). It also involves a limited exploration extent. Here, underground access is provided by a 4-long artificial gallery (the LSBB), which allows the collection of data from a wide range of flow conditions, illustrating the exceptional diversity and complexity of unsaturated zone flows. No other data articles on unsaturated zone flow could be found, which highlights the uniqueness of these data.
- •Monitored parameters allow the assessment of both pressure (thanks to pluviometry and flow temperature and discharge) and mass transfers (thanks to pluviometry and electrical conductivity) within the unsaturated zone. This can be useful for improving understanding of unsaturated zone flows, thanks to modeling, time series analysis or by developing a typology or classification of these kinds of flows that could be transferred to other karst systems having less accessible unsaturated zones.Atmospheric parameters can be used to improve the robustness of climate models by having an additional point, as the nearest weather station is 10 km away. These data can also be useful for long-term climate trend analysis, as this station is designed to be permanent.
- •The LSBB host numerous scientific activities from a wide range of research fields such as astrophysics and geosciences. Temporal and spatial variations in water content in the massif constitute part of the noise in the observations made and must therefore be characterized, which is what our dataset is contributing to.
Background
2
This dataset was generated to study the behavior of unsaturated zone flows in karst areas in order to improve the knowledge of karst aquifers recharge. The Vaucluse karst system (southeastern France) was chosen because of its particularly thick unsaturated zone (around 800 m) which helps to maintain an important discharge of its main outlet, even during dry periods [3]. In the south of the catchment, the unsaturated zone is randomly intersected by the artificial galleries of the LSBB (https://lsbb.cnrs.fr, Fig. 1), which allow the observation of three to more than fifty unsaturated zone flows depending on the hydrodynamic conditions. These flows are located at depth below surface ranging from ∼30 m to almost 500 m. Since 2002, these flows are point-in-time monitored (major elements, stable isotopes, TOC, physicochemical parameters, discharge) at a weekly to monthly time step. This monitoring frequency is too large to fully assess the behavior of these unsaturated zone flows. For this reason, we instrumented with continuous monitoring stations for temperature, electrical conductivity and discharge the 12 most frequently activated flows. To compare water outputs with inputs, data can be used from the weather station installed on the surface above the LSBB’s galleries, in order to measure pluviometry.Fig. 1(a) Fontaine de Vaucluse karst system catchment, major outlet and LSBB location on background of 1:50000 geological map (BD-CHARM from BRGM). Catchment delineation of Fontaine de Vaucluse was taken from [7]; (b) Position of the measuring stations on an enlarged view of the LSBB on background of a structural map (modified from [8]).Fig 1
Data Description
3
Location of the Measuring Stations
3.1
Study area is located in the Vaucluse karst system in southeastern France, inside and above the artificial galleries of the LSBB (https://lsbb.cnrs.fr, Fig. 1). The Vaucluse karst system is part of the French National Observatory of Karst (SNO Karst, https://sokarst.org/) and is characterized by a thick unsaturated zone of around 800 m [4]. LSBB galleries are dug in the unsaturated zone of the regional size aquifer, providing access to between 3 and more than 50 simultaneous flows. These flows are caused by faults, fractures and karst conduits, and slow-flows from porous rock. This panel of flow provides a unique window into the hydrological processes occurring in the unsaturated zone of karst aquifers. The 12 most frequently observed flows inside these galleries are instrumented and a weather station is installed at the surface, above flow D (Fig. 1b). In parallel, point-in-time measurement and water sampling are performed at least once a month from 2002 until now [5,6].
Available Data
3.2
Data consist in electrical conductivity, temperature and discharge of the 12 most frequently observed flows (A, B, C, D, W, AJ, T, GAS, AC, AY, I, and J; location Fig. 1b, Table 1); air temperature, pressure and humidity inside the galleries near flows D, W, AJ, J and I; and barometric pressure, solar radiation, humidity, temperature, wind speed, wind direction and pluviometry at the surface above the galleries (meteo station Fig. 1b). Data are available at an hourly time step from July 2022 to April 2024 and will be updated as stations continues to monitor. As most of these 12 flows are temporary, measurements are not available for the whole period (Fig. 2). There are also some data gaps for permanent flows (A–D), which correspond to temporary technical problems.Table 1. Latitude, longitude and depth of the monitoring station inside and above LSBB’s galleries.Table 1. StationLatitudeLongitudeDepth from surface (m)D43°56′11.70"N5°27′57.79"E33.3C43°56′22.69"N5°28′22.67"E256B43°56′27.71"N5°28′34.11"E421.5A43°56′28.94"N5°28′36.93"E442.2W43°56′32.00"N5°28′52.69"E437.4AJ43°56′13.97"N5°29′3.46"E359T43°56′1.00"N5°29′4.08"E195.7AC43°56′0.62"N5°29′1.31"E183.5GAS43°56′1.33"N5°29′0.93"E191.7AY43°55′51.92"N5°29′3.10"E104.1I43°55′46.85"N5°29′5.63"E60.2J43°55′46.83"N5°29′4.27"E58.2Meteo43°56′10.84"N5°27′56.53"E0Fig. 2Data availability chart (representation at a daily time step).Fig 2
Location and Structure of the Data
3.3
The data can be accessed directly by clicking on the following links, which takes the reader to a web page containing the title, dataset description, acknowledgements, funding and contributors:
- •Groundwater data from LSBB’s flows [1]:https://doi.org/10.26169/hplus.LSBB_water_high-frequency_monitoring
- •Weather data from the surface above the galleries and from the air inside the galleries [2]:https://doi.org/10.26169/hplus.LSBB_air_weather_high-frequency_monitoring
These web pages also contain a box called “Download links” which allows to download various files containing the data in .zip format. Each of the fourth .zip files contains a PDF file with the conditions of use of the data and a .csv file with the data. Table 2 summarises in which .csv file the different parameters are stored, and where the .csv files can be downloaded.Table 2. Location of the .csv files containing the parameters.Table 2. Type of Data and linkFile nameParameter namesUnitWeather data from the surface above the galleries and from the air inside the galleries:https://doi.org/10.26169/hplus.LSBB_air_weather_high-frequency_monitoringLSBB_air_probe_2022-2024.csvmean air temperature°Cmean barometric pressurembarmean humidity%LSBB_weather_data_buiss_2022-2024.csvglobal solar radiationW/m²mean barometric pressurembarmean humidity*%mean shelter temperature°Cmean wind speedm/spluviometrymmGroundwater data from LSBB’s flows:*https://doi.org/10.26169/hplus.LSBB_water_high-frequency_monitoring*Lsbb_ecoulements_tunnel_2022-2024.csvflowratem3/sLSBB_multiparameter_probe_station_2022-2024.csvconductivityµS/cmtemperature°C*
The files are organized to be easily managed with programming tools such as Python and R studio as the parameters are listed in a unique column called “Name of Parameter”. The 4 files contain the following columns:
-
Name of the site associated to the station: it is always “lsbb” to refer to the Low-Noise Underground laboratory, the study site;
-
Name of station: it indicates the flow name (e.g. AJ, B, D) or the meteorological station called Meteo;
-
Date and time of measurement: it is indicated in UTC;
-
Name of parameter: as listed in the column “Parameters name” in Table 2;
-
Value;
-
Unit;
-
Name of measuring device;
-
Type of measuring device;
-
E-mail of suppliers.
Examples of Data Representation
3.4
These data can be synthesized by box plot to assess the variability of unsaturated zone flow parameters (Fig. 3). They can also be combined with other variables on chronicles, as shown in Fig. 4.Fig. 3. Examples of data displays: Box plots of (a) discharge and (b) electrical conductivity of the 12 unsaturated zone flows of the Fontaine de Vaucluse karst system.Fig 3. Fig. 4Representation of discharge and electrical conductivity of flows A, D, AJ and T with precipitations from the weather station.Fig 4
Experimental Design, Materials and Methods
4
Material Description
4.1
The 12 continuous monitoring stations installed at flows A–D, W, AJ, T, GAS, AC, AY, I and J are built in the same model (Fig. 5).Fig. 5. Monitoring station at flow D. Same stations are present at flows A, B and C; and same station without the field fluorimeter is installed at flows W, AJ, T, GAS, AC, AY, I, J. Fluorimeter data are not considered in this paper.Fig 5
The water that flows along the wall is collected by metal bars towards the funnel conducting water in a PVC pipe. A probe measuring electrical conductivity and water temperature is inside the PVC pipe below the funnel. The pipe shrinkage bellows the funnel supports the probe and allows for the water storage necessary for measurement. Thanks to the pipes, water is transported either towards a fluorimeter (at flows A, B, C and D, data not considered in this paper) or directly to the flowmeter which consists of a plexiglass tube with a pressure sensor at the bottom. When the water within flowmeter reaches a defined level, a sensor commands the opening at the bottom of the tube and close the opening when the tube is empty.
LSBB galleries are also equipped with sensors measuring air pressure, air temperature and humidity at points W, D, AJ, J and I (see location in Fig. 1). At the surface above flow D, a weather station monitors air pressure, air temperature, pluviometry, solar radiation, wind speed and wind direction.
Measurements of all parameters are taken with a time step of 1 s and instrument type is provided in Table 3. Operator intervention for sampling at a weekly to monthly interval requires the disassembling of the station for easy water sampling. During each intervention, operator also injects Hydrochloric acid (around 100 mL) to keep the station clean from calcite precipitation.Table 3. Description of the measuring tools.Table 3. Measuring toolFabricantSensor referenceParametersWeather stationGill Instruments Limited Saltmarsh Park, 67 Gosport Street, Lymington, Hampshire, SO419EG, UKMetPak Pro with Integrated WindSonicAir pressure (mbar), air temperature (°C), pluviometry (mm), wind speed (m.s^−1^) and wind direction (°), solar radiation (W.m^−2^)FlowmeterSICK AG Sensor Intelligence, 21, boulevard de Beaubourg, ZI Paris Est - BP 42, 77184 ÉmerainvillePFT-SRBX10SG1SSAAMSSZPressure (mA)Water probeAQUALABO, 90 Rue du Professeur Paul Milliez, 94506 Champigny-sur-MarneSensor C4EElectrical conductivity (µS/cm) and water temperature (°C)Air probeYoctopuce Sarl, 33, route de Cartigny, 1236 Cartigny, SUISSEYocto-MeteoAir pressure (mbar), air temperature (°C) and humidity (%)
Discharge Calculation
4.2
The flowmeter consists of a plexiglass tube in which the water is collected. It has a pressure sensor at the bottom that measures the pressure inside the tube. When the water level reaches a defined level, the draining system commands the opening of the plexiglass tube at its base to drain the water. When the tube is empty, it automatically closes to allow for refilling. In the raw data, the flowmeter provides two kind of information at 1 s intervals: the pressure inside the plexiglass tube (value in mA) and whether the tube is open (value 0 or 1).
To convert these variables into discharge data, we first estimated the volume of water inside the plexiglass tube using a linear model (ax+b) between volume and pressure. This linear model was obtained by direct experimentation on the stations within the LSBB galleries, by adding known and increasingly larger volumes (from 200 to 500 mL) of water in the plexiglass tubes (3 times at least for each flowmeter) while retrieving the corresponding pressure value in mA given by sensors (r² = 09968 to 1). The data are then averaged to 10 s time step to remove the noise associated with measurements uncertainties.
The time series representing the volume over time shows the peaks and lows caused by the opening and closing of the plexiglass tube (Fig. 6a and b). The discharge is obtained by (i) first identifying all the extreme values of tube filling/emptying, resulting in two values (min, max) for each cycle (the lows corresponding to the plexiglass tube that has been emptied and the highs to the last values before emptying); (ii) then calculating the derivative of volume with respect to time for each cycle, which is deemed appropriate as we aim for a final hourly time step and the cycles are always shorter than 30 min. The occasional consecutive values on the same extreme (min or max) are removed as they are artefacts of sampling uncertainty. The resulting discharge value is assigned to the median time between the highest and lowest extremes (Fig. 3c). Finally, the discharge is converted to m^3^.s^−1^.Fig. 6. Transformation of raw data from the flowmeter (a and b) to discharge (c). Data is from flow A, September 2022.Fig 6
Data Processing and Validation
4.3
Using R software, the data have been set to an hourly time step as it is more suitable for pre-processing discharge data because of the closing and opening delay of the plexiglass tube. Data validation is performed by the following steps:
- (i)Removal of erroneous values, i.e. extremes that are the result of a probe dysfunction or an operator intervention, e.g. for discharge it consists in removing values lower than 0.05 L.h^−1^ and greater than 500 L.h^−1^; for electrical conductivity lower than 100 µS.cm^−1^ and greater than 1200 µS.cm^−1^. For the precipitation, any negative values were set to 0.
- (ii)Removal of isolated outliers on the discharge time series using a Hampel filter, which identifies outliers based on deviation from the median within a sliding window [9].
- (iii)Control of the accuracy of the continuous measured values with respect to the point-in-time monitoring (measured at least once a month with calibrated instruments). This helps to identify potential sensor measurement drift and enables appropriate correction when necessary.
- (iv)Removal of more subtle outliers on the discharge, electrical conductivity and temperature time series caused by the regular operator intervention for sampling water and cleaning the monitoring station. Those can be easily identified since the date and time of each intervention are well-documented.
- (v)Removal of values when the flow is inactive (only A, B, C and D are permanent, the other 8 are temporary). For this purpose, both electrical conductivity and water temperature values are set to Non Available (NA) when the discharge is zero.
Limitations
The first limitation to mention is that for temporary flows (all flows except A, B, C and D), data gaps due to temporary technical problems can be interpreted as flow inactivity. However, flow inactivity is less random than technical problems as it is related to environmental conditions such as rain.
Second, discharge is used as an indicator of inactive flow in order to remove electrical conductivity and temperature data while the flow is dry. However, some problems with opening or closing the Plexiglas tube of the flowmeter (e.g. calcite buildup) may lead to the misinterpretation of the flow activity, and therefore to the deletion of accurate temperature and electrical conductivity data.
Ethics Statement
Authors have read and follow the ethical requirements for publication in Data in Brief. This work meets these requirements and authors confirm that the current work does not involve human subjects, animal experiments, or any data collected from social media platforms.
CRediT authorship contribution statement
Leïla Serène: Writing – original draft, Conceptualization. Guillaume Cinkus: Writing – review & editing, Visualization, Formal analysis, Conceptualization. Naomi Mazzilli: Writing – review & editing, Supervision, Methodology, Conceptualization. Jean-Baptiste Decitre: Methodology. Christophe Emblanch: Conceptualization, Methodology. Franck Tison: Methodology. Julien Dupont: Methodology. Milanka Babic: Methodology. Roland Simler: Methodology. Matthieu Blanc: Methodology.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Mazzilli N.high-frequency monitoring of discharge, electrical conductivity and temperature of flows from the low noise underground laboratory (LSBB - unsaturated zone of the Fontaine-de-Vaucluse karst system)SNO H+ OSURIS data repository 202410.26169/HPLUS.LSBB_WATER_HIGH-FREQUENCY_MONITORING
- 2Mazzilli N.ambiant air data inside and atmospheric data outside LSBB artificial gallery (unsaturated zone of the Fontaine-de-Vaucluse karst system)SNO H+ OSURIS data repository 202410.26169/HPLUS.LSBB_AIR_WEATHER_HIGH-FREQUENCY_MONITORING · doi ↗
- 3Emblanch C.Zuppi G.M.Mudry J.Blavoux B.Batiot C.Carbon 13 of TDIC to quantify the role of the unsaturated zone: the example of the Vaucluse karst systems (Southeastern France)J. Hydrol.2791–4200326227410.1016/S 0022-1694(03)00180-X · doi ↗
- 4Ollivier C.Karst recharge-discharge semi distributed model to assess spatial variability of flows Sci. Total Environ.703202013436810.1016/j.scitotenv.2019.13436831731168 · doi ↗ · pubmed ↗
- 5KARST SNO Time series of type physico-chemistry in Fontaine de Vaucluse basin - FONTAINE DE VAUCLUSE observatory - KARST observatory network - OZCAR critical zone network research infrastructure OSU OREME data repository 202410.15148/FB 67E 0B 8-4B 19-4C 46-875B-0E 2D 44EA 485F
- 6KARST SNO Time series of type chemistry in Fontaine de Vaucluse basin - FONTAINE DE VAUCLUSE observatory - KARST observatory network - OZCAR Critical Zone network Research Infrastructure OSU OREME data repository 202410.15148/2A 71AB 59-CBC 8-4B 0F-9A 00-2CA 7DA 7BAF 3D
- 7Ollivier C.Caractérisation et spatialisation de la recharge des hydrosystèmes karstiques: application à l’aquifère de Fontaine de Vaucluse, France 2020 Université d’Avignonhttps://theses.hal.science/tel-02614260
- 8Barbel-Périneau A.Caractérisation du fonctionnement de la zone non saturée des aquifères karstiques: approche directe par études hydrodynamiques et hydrochimiques sur le Bassin de Recherche, d’Expérimentation et d’Observation de Fontaine de Vaucluse – Laboratoire Souterrain à Bas Bruit de Rustrel – Pays d’Apt 2013 Université d’Avignon et des Pays de Vauclusehttps://theses.hal.science/tel-00990290 v 1
