Skip to content

nachiket273/tbg-stm-ftldos-hubbard-U-regression

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 

Repository files navigation

DOI

Learning Interaction-Dependent Features from STM FT-LDOS Images using Machine Learning

This repository contains the code accompanying the research work:

Prediction of Hubbard U parameter from Moiré System STM images using image recognition

This project investigates whether interaction-dependent features encoded in simulated scanning tunneling microscopy (STM) Fourier-transformed local density of states (FT-LDOS) images can be learned using convolutional neural networks (CNNs).

The implemented framework provides dataset generation, training, evaluation, and analysis pipelines for supervised regression of interaction-dependent electronic structure features from FT-LDOS images.


Scientific Motivation

Moiré systems, such as twisted bilayer graphene, exhibit strongly correlated electronic phases arising from electron–electron interactions. The Hubbard interaction strength (U) plays a central role in determining these phases.

Scanning tunneling microscopy (STM) provides spatially resolved measurements of the local density of states (LDOS), and its Fourier transform (FT-LDOS) reveals momentum-space information including quasiparticle interference and correlation-induced spectral features.

This work investigates whether machine learning models can learn interaction-dependent patterns in FT-LDOS images generated from theoretical simulations.

The goal is not to directly infer experimental Hubbard parameters, but to quantify and characterize interaction-dependent features encoded in STM-derived observables within a controlled theoretical framework.


Repository Structure

src/
| - common/
 | - helper.py # Utility functions: random seeding, dataset splitting, model saving/loading.
| - fourier_transform/
 | - stm_ft.py # FT-LDOS dataset generation from simulated LDOS data.
| - regression/
 | - model_regress.py # Neural network architectures: Custom CNN, Modified ResNet-18 regression model.
 | - run_regress.py # Training pipeline
 | - test_regress.py # Evaluation pipeline
 | - visualize_regress.py # Visualization and analysis tools
 | - grad_cam_regress.py # Grad-CAM based model interpretability
| - config/
 | - config.ini # Hyperparameter configuration


Method Overview

The pipeline consists of four stages:

  1. Generate simulated FT-LDOS dataset
  2. Train neural network regression models
  3. Evaluate model performance on unseen data
  4. Analyze learned representations and feature importance

The models learn a mapping:
FT-LDOS image → interaction-dependent electronic structure features


Installation

Create Python environment:

conda create -n stm_ml python=3.10
conda activate stm_ml

Install dependencies:

pip install torch torchvision numpy scipy scikit-learn matplotlib tqdm
pip install numba
pip install grad-cam

Dataset Generation

Generate FT-LDOS images:

python fourier_transform/stm_ft.py --data_dir [DATA_DIR] --images_dir [IMAGES_DIR]

where:
--data_dir : Base directory containing LDOS data.
--images_dir : Target directory to save FT-LDOS images.

This produces:

  • Fourier-transformed LDOS images
  • Labels corresponding to interaction-dependent simulation parameters

Training

Train regression model:

python regression/stm_main_regress.py --data_dir [DATA_DIR] --model_type [MODEL_TYPE] --model_path [MODEL_PATH]

where:
--data_dir : Directory containing FT-LDOS images.
--model_type : model type - fc, conv or pretrained.
--model_path : Path to save trained model weights. \

Features:

  • Stratified dataset splitting
  • Fixed random seeds
  • Early stopping
  • Model checkpoint saving

Supported architectures:

  • Custom CNN
  • ResNet-18 (modified for regression)

Evaluation

Evaluate trained model:

python regression/test_regress.py --data_dir [DATA_DIR] --model_type [MODEL_TYPE] --model_path [MODEL_PATH]

where:
--data_dir : Directory containing test set FT-LDOS images.
--model_type : model type - fc, conv or pretrained.
--model_path : Path to load trained model weights. \

Evaluation metrics:

  • Mean Absolute Error (MAE)
  • Root Mean Squared Error (RMSE)
  • Coefficient of determination (R²)

Interpretability

Guided Backpropagation: Provides insight into spatial regions contributing to gradient for backpropaation.

python regression/visualize_regress.py --data_path [DATA_PATH] --save_path [SAVE_PATH] --model_type [MODEL_TYPE] --model_path [MODEL_PATH]

where:
--data_path : Directory containing FT-LDOS images.
--save_path : Directory to save guided basckpropagation mask and images.
--model_type : model type - fc, conv or pretrained.
--model_path : Path to load trained model weights. \

Grad-CAM visualization: Provides insight into spatial regions contributing to model predictions.

python regression/grad_cam_regress.py --data_path [DATA_PATH] --save_path [SAVE_PATH] --model_type [MODEL_TYPE] --model_path [MODEL_PATH]

where:
--data_path : Directory containing FT-LDOS images.
--save_path : Directory to save grad-cam masked images.
--model_type : model type - fc, conv or pretrained.
--model_path : Path to load trained model weights. \


Configuration

Hyperparameters are defined in:

src/config/config.ini

including:

  • Number of k-points
  • Initial Learning Rate
  • Number of Epochs
  • Batch Size
  • Dropout Rate
  • Learning Rate Scheduler
  • Early Stoppin Patience
  • Random Seed
  • and Others...

Intended Use

This repository is intended for:

  • Computational condensed matter physics research
  • Machine learning analysis of simulated STM data
  • Methodological studies of feature extraction from electronic structure simulations

This repository is not intended for direct experimental parameter inference without further validation


Citation

@misc{tanksale2026predictionatomistichubbardu,
      title={Prediction of the atomistic Hubbard U interaction from moir\'e system STM-images using image recognition}, 
      author={Nachiket Tanksale and Tobias Stauber},
      year={2026},
      eprint={2602.18890},
      archivePrefix={arXiv},
      primaryClass={cond-mat.mes-hall},
      url={https://arxiv.org/abs/2602.18890}, 
}

About

Machine learning framework for inferring interaction-dependent features from simulated STM-derived Fourier-transformed local density of states (FT-LDOS) images in TBG moiré systems.

Topics

Resources

License

Stars

Watchers

Forks

Packages

 
 
 

Contributors