Published June 30, 2026 | Version v1
Model Open

Bitumen Refractive Index and ATR-FTIR baseline correction (script)

Description

ATR-FTIR Baseline Correction Pipeline for Bituminous Materials

Overview

This repository contains a complete processing pipeline for ATR-FTIR spectra of bituminous materials, with a focus on:

  • Physics-based ATR correction
  • Baseline correction
  • Spectral normalization
  • Quantitative peak integration
  • Comparative visualization

The correction approach is based on refractive index–dependent ATR penetration depth and characteristic aliphatic band vibrations, allowing improved comparison between spectra and more physically meaningful interpretation of band areas.

The pipeline is designed to process raw FTIR spectra in a predefined CSV format and automatically generate corrected datasets and publication-ready plots.

Scientific Background

ATR-FTIR measurements of bituminous materials are strongly influenced by:

  • Wavenumber-dependent penetration depth
  • Refractive index mismatch
  • Baseline distortions
  • Band intensity scaling effects

This script applies:

  1. ATR correction based on refractive index assumptions
  2. Baseline correction
  3. Spectral normalization
  4. Quantitative integration of key functional group regions

Special attention is given to aliphatic CH stretching and rocking vibrations, which are used as internal references in the correction workflow.

Pipeline Structure

The main pipeline performs the following steps:

1. Extraction and Normalization

  • Reads raw CSV spectra
  • Averages replicates (if present)
  • Stores extracted spectra
  • Generates normalized spectra

2. ATR + Baseline Correction

  • Applies ATR correction based on penetration depth
  • Applies baseline correction
  • Generates corrected and normalized corrected spectra

3. Spectral Parameter Extraction

Integrates characteristic peak regions:

Functional GroupWavenumber Range (cm⁻¹)
Aliphatic CH stretching2800–3000
Carbonyl (C=O)1650–1750
Aromatic C=C1520–1650
Sulfoxides (S=O)980–1070
Aromatic bending735–910
Aliphatic CH rocking710–735  

Areas are extracted for:

  • Raw spectra
  • Raw ATR-corrected spectra
  • Normalized raw spectra
  • Normalized ATR-corrected spectra

4. Visualization

The script automatically generates:

  • Raw vs corrected spectra plots
  • Difference spectra
  • Aliphatic rocking vs stretching comparison plots
  • Bar plots of integrated peak areas (dual y-axis comparison)
  • Fitted ATR parameter visualization

All figures are saved to disk for further analysis.

Input Requirements

Raw Data Format

The script expects raw spectra as CSV files with:

  • Column containing wavenumber (cm⁻¹)
  • Column(s) containing absorbance values
  • Wavenumber axis in descending or ascending order (consistent format)
  • Uniform spacing across spectra

Improper formatting will prevent correct parsing.

It is recommended that:

  • All spectra share the same spectral range
  • Units are in cm⁻¹ and absorbance

Output

The pipeline generates:

  • Extracted spectra JSON files
  • Normalized spectra JSON files
  • ATR-corrected spectra JSON files
  • Integrated spectral parameter JSON file
  • PNG figures for:
    • Spectra comparison
    • Difference spectra
    • Peak area barplots
    • ATR fitting diagnostics

Goal of the Project

The purpose of this project is to provide a reproducible and physically informed correction workflow for ATR-FTIR spectra of bituminous materials, enabling:

  • Reliable comparison between samples
  • Reduction of ATR-induced distortions
  • Quantitative analysis of ageing and oxidation markers
  • Improved interpretation of aliphatic band behaviour

The pipeline is intended for research applications in:

  • Bitumen ageing studies
  • Oxidation monitoring
  • Chemical fingerprint analysis
  • Material comparison studies

Running the Pipeline

python main.py --raw data/raw --extracted data/extracted --corrected data/corrected --plots data/spectra_plots

Default directories are provided but can be modified via CLI arguments.

Notes

  • The ATR correction relies on assumed refractive index behaviour.
  • Results should be interpreted within the limits of the physical model.
  • The script is designed specifically for bituminous materials and may require adaptation for other material systems.
  • The zipped folder "data.zip", should be extracted before the script is run and it contains a small series of exemplary data

Other (English)

The GNU General Public License holds for the python script. The Creative Commons Attibution is here for the dataset provided with exemplary data for the model.

Files

data.zip

Files (2.2 MiB)

NameSize
md5:62184f620aaf2954057e4c1c0a78cefa
6.3 KiBDownload
md5:04ff1449de614530b341af44f7061175
2.2 MiBPreview Download
md5:2ef139c536250ce75a4f162c3bd293b6
3.8 KiBDownload
md5:203eb30546f47fd6f39d2aaeba8c7552
4.6 KiBDownload
md5:de0740127dac549b4a19ee2bcca9080c
4.9 KiBDownload
md5:75c57489d521cace7d4f66cbe278497b
3.9 KiBDownload
md5:209c49d4fc587edb9a9d4e00ea6de760
9.4 KiBDownload
md5:426597accb30698232e6f3777eb769b2
4.4 KiBPreview Download
md5:7fc1e0156f8b835be0851b4c6c75c364
54 BytesPreview Download
md5:e06f1aa86e393886e1f8fe10c59134a6
6.0 KiBDownload

Additional details

Related works

Is supplement to
Dataset: 10.48436/10avt-mr027 (DOI)