LCSB R³
Responsible and Reproducible Research

Non-Target Screening of Surface Water Samples to Identify Exposome-Related Pollutants: A Case Study from Luxembourg#

Authors#

Dagny Aurich, Phillipe Diderich, Rick Helmus, Emma Schymanski

Abstract#

Background: Non-target screening of surface water samples collected over an extended period can reveal interesting temporal patterns in exposome-related pollutants. Additionally, geographical data on pollution sources in proximity to the sampling sites, chemical classification data and the consideration of flow paths can provide valuable information on the origins and potential threat of tentatively identified chemical compounds. In this study, 271 surface water samples from 20 sampling sites across Luxembourg were analysed using high resolution mass spectrometry, complementing routine target monitoring efforts in 2019-2022. Data analysis was performed using the open-source R-package patRoon, which offers a customizable non-target workflow. By employing open-source workflows featuring scoring terms, like the individual MoNA score and applying identification levels, tentative identifications can be prioritized, e.g. based on spectral similarity. Furthermore, by utilizing supplementary database information such as PubChemLite categories and classification software such as classyFire, an overall assessment of the potential threats posed by the tentatively identified chemicals was conducted, enabling the prioritization of chemicals for future confirmation through targeted approaches.

Results: The study tentatively identified 375 compounds associated with the exposome including benzenoids, organoheterocyclic compounds, and organic phosphoric acids and derivatives (12 classyFire superclasses, 50 sub-classes). The classification analysis not only revealed temporal variations in agrochemicals, with the majority of identifications occurring in May to July, but also highlighted the prevalence of pharmaceuticals such as venlafaxine in surface waters. Furthermore, the study examined potential sources of pollutants, like metallurgic industry or household products by considering common uses and geographical information, as commercial uses of almost 100% of the identified chemicals are known. 42 chemicals were suggested for potential inclusion to governmental monitoring lists.

Conclusions: The findings of this study complement existing knowledge on the pollution status of surface water in Luxembourg and highlight the usefulness of non-target screening for identifying temporal and spatial trends in pollutant levels. This approach, performed in a complementary manner to routine monitoring, can help to tentatively identify chemicals of concern for inclusion in target monitoring methods following additional confirmation and quantification efforts.

Data, code, figures and supplementary files#

All data, code, figures and supplementary files can be found in the ‘data_luxwater_nt_paper_da’ repository of the ECI group in GitLab, via https://gitlab.lcsb.uni.lu/eci/data_luxwater_nt_paper_da under license Artistic 2.0. The required sample identifiers to link the measurement to location and date can be found in this GitLab file.

mZML data#

The measurement files in mzML format can be found as dataset MSV000092221 from the GNPS MassIVE repository, accessible via ftp://massive.ucsd.edu/MSV000092221/ and to be cited with DOI: 10.25345/C55X25P62.