ECG Contrastive + Linear Probe Classifier

1) Overview

This project was developed while participating in PhysioNet Challenge 2025{https://moody-challenge.physionet.org/2025/}

It takes 12-lead ECG signals as input and follows a two-stage pipeline:

Contrastive Pretraining (self-supervised learning)
Linear Probe Supervised Fine-tuning

to perform binary classification.

Input length: variable length → unified by padding/clipping (4096)
Sampling rate: resampled to 400 Hz
Core model: ResNet1D + SE Block + Transformer Block encoder, trained contrastively and then fine-tuned with a linear probe classifier

All hyperparameters are decided empirically

2) File Descriptions

The repository contains both official baseline scripts (provided by the challenge organizers) and custom modules:

train_model.py → Baseline training entry script (provided)
run_model.py → Baseline inference entry script (provided)
evaluate_model.py → Evaluation script for computing challenge metrics (provided)
helper_code.py → Utility functions for data I/O and ECG header/signal parsing (provided)
team_code.py → Main file containing custom training pipeline (train_model function) and inference pipeline (run_model function)
util_nh.py → Core custom implementation: preprocessing (SignalProcessor), model architectures (ResNet1D, SEBlock1D, TransformerBlock1D), augmentation, dataset classes, and training helpers
requirements.txt → Python dependencies for reproducibility

3) Preprocessing

`SignalProcessor`

Resample (→ 400 Hz)
Replace NaNs with finite values
Bandpass filter: 0.5–45 Hz (4th-order Butterworth)
Notch filter: 60 Hz powerline noise removal

Normalization

def normalize_leads(arr):

    (arr - mean) / std  # per-lead normalization

Padding

Pad all samples to shape (max_len, num_leads)

4) Data Augmentation

`augment_signal_v2`

Add Gaussian noise
Random scaling (0.9 ~ 1.1)
Time masking (mask 10% of the signal with zeros)
Random temporal shift (±5 samples)

Used to generate two augmented views (v1, v2) for contrastive learning.

def augment_signal_v2(x):
    x = x + torch.randn_like(x) * 0.025
    scale = torch.empty(1).uniform_(0.9, 1.1).to(x.device)
    x = x * scale
    t = x.size(1)
    mask_len = int(t * 0.1)
    start = torch.randint(0, t - mask_len, (1,)).item()
    x[:, start:start + mask_len] = 0
    shift = torch.randint(-5, 6, (1,)).item()
    if shift > 0:
        pad = torch.zeros(x.size(0), shift, device=x.device)
        x = torch.cat([x[:, shift:], pad], dim=1)
    elif shift < 0:
        pad = torch.zeros(x.size(0), -shift, device=x.device)
        x = torch.cat([pad, x[:, :shift]], dim=1)
    return x

5) Dataset Classes

ContrastiveECGDataset: Takes one ECG signal and applies augment_signal_v2 twice → returns (v1, v2).
SupervisedECGDataset: Returns (signal, label) pairs → used for linear probe training.

6) Model Architecture

(1) SEBlock1D

Squeeze-and-Excitation module (channel-wise attention)
Global Average Pooling → Conv1d(reduction) → ReLU → Conv1d → Sigmoid

(2) TransformerBlock1D

Multihead Self-Attention (nn.MultiheadAttention)
LayerNorm + residual connections
Position-wise MLP (Linear → GELU → Linear)

(3) BasicBlock1D

ResNet-style 1D convolutional block
Conv-BN-ReLU → Conv-BN → SEBlock → Residual connection

(4) ResNet1DEncoder

Stem: Conv(7x1, stride=2) + BN + ReLU + MaxPool
Layer1–4: ResNet blocks (64 → 128 → 256 → 512)
Global Avg Pooling → TransformerBlock → Projection Head

Outputs:

x: encoder feature (512-d)
z: projection head embedding (contrastive learning, normalized)

7) Contrastive Loss: NT-Xent

Input: z1, z2 (augmentation pair)
Positive pairs: different views of the same sample
Negative pairs: all other samples in the batch
Softmax + CrossEntropy-based loss

8) Linear Probe Classifier

`LinearProbeHead`

2-layer MLP:
- Input: encoder feature (512-d)
- Hidden: 128-d
- Output: num_classes (default=2)

`LinearProbeTrainer`

Encoder is frozen; only the linear head is trained
Optimizer: Adam (lr=1e-4)
Loss: Weighted CrossEntropy
- class_weights = [0.5, 1.0] (to address class imbalance)
Model saving: torch.save(self.head.state_dict(), save_path)

9) Training Flow

Phase 1: Contrastive Pretraining

Apply augmentations to ECG signal → create (v1, v2) pairs
Pass through encoder + projection head
Train using NT-Xent loss

Phase 2: Linear Probe Fine-tuning

Freeze encoder
Load labeled ECG data via SupervisedECGDataset
Train only the linear probe head with CrossEntropyLoss

10) Key Design Choices

ResNet + SEBlock + Transformer: combines local pattern extraction, channel attention, and global dependencies
Contrastive pretraining improves generalization on limited/noisy labels
Linear probe allows efficient use of labels
Design considers ECG-specific challenges (noise, variability, varying sequence lengths)

11) Minimal Example

# Pretrain contrastive
dataset = ContrastiveECGDataset(signals)
loader = DataLoader(dataset, batch_size=64, shuffle=True)
encoder = ResNet1DEncoder(in_ch=12)

# Fine-tune with Linear Probe
trainer = LinearProbeTrainer(
    encoder, signals_tensor, labels_array, save_path="linear_probe.pt",
    batch_size=64, lr=1e-4, device="cuda"
)
trainer.train(epochs=5)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ECG Contrastive + Linear Probe Classifier

1) Overview

2) File Descriptions

3) Preprocessing

`SignalProcessor`

Normalization

Padding

4) Data Augmentation

`augment_signal_v2`

5) Dataset Classes

6) Model Architecture

(1) SEBlock1D

(2) TransformerBlock1D

(3) BasicBlock1D

(4) ResNet1DEncoder

7) Contrastive Loss: NT-Xent

8) Linear Probe Classifier

`LinearProbeHead`

`LinearProbeTrainer`

9) Training Flow

Phase 1: Contrastive Pretraining

Phase 2: Linear Probe Fine-tuning

10) Key Design Choices

11) Minimal Example

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
ml_experiments		ml_experiments
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
evaluate_model.py		evaluate_model.py
helper_code.py		helper_code.py
requirements.txt		requirements.txt
run_model.py		run_model.py
team_code.py		team_code.py
train_model.py		train_model.py
util_nh.py		util_nh.py

Folders and files

Latest commit

History

Repository files navigation

ECG Contrastive + Linear Probe Classifier

1) Overview

2) File Descriptions

3) Preprocessing

SignalProcessor

Normalization

Padding

4) Data Augmentation

augment_signal_v2

5) Dataset Classes

6) Model Architecture

(1) SEBlock1D

(2) TransformerBlock1D

(3) BasicBlock1D

(4) ResNet1DEncoder

7) Contrastive Loss: NT-Xent

8) Linear Probe Classifier

LinearProbeHead

LinearProbeTrainer

9) Training Flow

Phase 1: Contrastive Pretraining

Phase 2: Linear Probe Fine-tuning

10) Key Design Choices

11) Minimal Example

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

`SignalProcessor`

`augment_signal_v2`

`LinearProbeHead`

`LinearProbeTrainer`

Packages