Add fine-tuned SuperAnimal-Quadruped support and improve demo setup by xiu-cs · Pull Request #30 · AdaptiveMotorControlLab/FMPose3D

xiu-cs · 2026-05-17T06:55:29Z

Summary

This PR adds first-class support for the fine-tuned SuperAnimal-Quadruped 2D pose model used by the animal pipeline, enabling direct 26-joint Animal3D keypoint prediction and automatic checkpoint download from Hugging Face. It also improves the out-of-the-box demo/install path, fixes CPU fallback for the human HRNet demo, and removes unused legacy code/assets.

Changes

Add support for a fine-tuned SuperAnimal-Quadruped 2D checkpoint that predicts the 26-joint Animal3D layout directly.
Auto-download animal demo checkpoints from Hugging Face on first run:
- sa_finetune_hrnet_w32.pt for 2D animal pose
- fmpose3d_animals.pth for the 3D lifter
Refactor animals/demo/vis_animals.py to build the 2D estimator and 3D lifter once, then reuse them across images.
Add SuperAnimalConfig options for fine-tuned checkpoints, detector overrides, and lazy Hugging Face resolution.
Update animal defaults and docs from older Rat7M/legacy assumptions toward Animal3D.
Fix human HRNet loading on CPU-only environments by using device-aware map_location and moving inputs to the model device.
Pin install dependencies to torch>=2.4.1,<2.5 and torchvision>=0.19.1,<0.20, and document the PyTorch/CUDA behavior in the README.
Restrict package Python metadata to >=3.10,<3.13; README recommends Python 3.10 because install/demo paths were tested there.
Remove unused legacy animal modules and unused YOLO/HRNet assets.
Add/update tests for the fine-tuned SuperAnimal path and config behavior.
Add mot to the codespell ignore list.

Validation

Ran install, test, and demo checks locally:

python3 -m pip install -e '.[animals,viz]' --dry-run
python3 -m pytest tests/test_demo_human.py tests/fmpose3d_api/test_fmpose3d.py -q
python3 -m pytest tests/test_model.py tests/test_training_pipeline.py -q
bash demo/vis_in_the_wild.sh
bash animals/demo/vis_animals.sh

Results

Human demo passes on both CPU-only and GPU paths.
Animal demo passes and auto-resolves both Hugging Face checkpoints.
Relevant tests pass: 78 passed for human demo/API tests, 8 passed for model/training smoke tests.

…ine-tuning

…descriptions

…model_path

…ator and streamline debug handling

…rity and parameter naming

… process

…auto-download functionality

…e-tuned and stock modes

…ading

- Deleted `graph_utils.py`, which contained functions for adjacency matrix creation and normalization. - Removed `lifter3d.py`, which included keypoint processing, 3D triangulation, and visualization functions. - Eliminated `mocap_dataset.py`, which defined the `MocapDataset` class for handling motion capture data.

… root path accordingly

…entation

…3D compatibility

…uperAnimalEstimator

… and reuse across images, improving efficiency and clarity.

…lity

…handling

…n pyproject.toml

…h/CUDA installation notes

… main_animal3d.py

…local weights

deruyter92

Great PR which definitely improves the package. I really like the addition of the fine-tuned SuperAnimal 2D model!

A few remarks:

small bug in partial cleanup for rat7m
the lazy downloading from hugginface is not working as I think you intended it
the predict() method should be cleaned a bit
it would be great if you add tests for the new auto-download branch

Overall good PR! See comments

deruyter92 · 2026-05-22T08:33:22Z

+def build_2d_estimator():
+    """Build the 2D pose estimator once. Snapshot resolves lazily on first predict.
+
+    Empty --saved_2d_model_path -> auto-download fine-tuned snapshot from HF.
+    Non-empty path -> use as a local override.
+    """
+    from fmpose3d.common.config import SuperAnimalConfig
+    from fmpose3d.inference_api.fmpose3d import SuperAnimalEstimator
+    from fmpose3d.utils.weights import resolve_weights_path
+


Well done refactoring this: way cleaner, and also more efficient! Few comments:

The docstring seems to contain an error: the statement "snapshot resolves lazily on first predict" is not correct, since it is resolved immediately.

The resolve_weights_path seems to download from HF directly with an empty path, which seems to be inconsistent with the approach elsewhere (letting it trigger by the predict method)

Minor nitpick: I think the imports in this case can stay on the top of the file. I would lazily import only for heavy packages (like deeplabcut) or modules that are super specific for a single function. These are all lightweight central helpers, so might belong on the top of the file instead.

Suggested change

def build_2d_estimator():

"""Build the 2D pose estimator once. Snapshot resolves lazily on first predict.

Empty --saved_2d_model_path -> auto-download fine-tuned snapshot from HF.

Non-empty path -> use as a local override.

"""

from fmpose3d.common.config import SuperAnimalConfig

from fmpose3d.inference_api.fmpose3d import SuperAnimalEstimator

from fmpose3d.utils.weights import resolve_weights_path

def build_2d_estimator():

"""Build the 2D pose estimator once.

Empty --saved_2d_model_path -> auto-download fine-tuned snapshot from HF.

Non-empty path -> use as a local override.

"""

deruyter92 · 2026-05-22T08:42:15Z

-    print(f"    - Left hind leg: {graph_rat.left_hind}")
-    print(f"    - Right hind leg: {graph_rat.right_hind}")
-    print(f"    - Spine: {graph_rat.spine}")
    print(f"  Distance to center (joint 4): {graph_rat.dist_center}")


I think this one was forgotten in the removal of the Rat7M code..

Suggested change

print(f" Distance to center (joint 4): {graph_rat.dist_center}")

deruyter92 · 2026-05-22T08:58:57Z

+        pose_snapshot_path = cfg.pose_snapshot_path
+        if not pose_snapshot_path and cfg.auto_download_finetuned:
+            from fmpose3d.utils.weights import resolve_weights_path
+            pose_snapshot_path = resolve_weights_path("", "sa_finetune_hrnet_w32.pt")


when auto-download is True and the path is not provided, resolve_weights_path is called on every predict call. (i.e. hf_hub_download checks the local cache on every call)

I think this could add up for videos with many frames. Instead, this should be resolved once (the first predict call)! e.g. you could define an attribute in __init__ that contains the downloaded weights path after the first download? or a simple flag.

deruyter92 · 2026-05-22T09:00:22Z

+        # Fine-tuned mode: non-empty resolved path swaps the stock 39-joint head
+        # for a custom DLC checkpoint that predicts the 26-joint Animal3D layout
+        # natively (no _map_keypoints needed).
+        is_finetuned = bool(pose_snapshot_path)


Same here, this can be resolved in __init__. (right now, all information is derived from a static config, which is available at initialization time)

deruyter92 · 2026-05-22T09:10:08Z



-def resolve_weights_path(model_weights_path: str, model_type: str) -> str:
+def resolve_weights_path(local_path: str, filename: str) -> str:


I think it's fine right now (since nobody is probably using this function right now), but we should be careful with renaming keyword arguments, as they can break peoples scripts.

i.e. this is not backward compatible for people who used to handle the weights in their own scripts:

from fmpose3d.utils import resolve_weights_path configured_path = "" my_weights_path = resolve_weights_path(model_weights_path=configured_path) # <- breaks now!

or more concerning:

from fmpose3d.utils import resolve_weights_path my_weights_path = resolve_weights_path(model_type="fmpose3d_humans") # <- breaks now!

TL;DR I think its fine for now, as you updated all the call sites internally, but be aware that people might use these public functions in their own scripts as well. We should try to keep all public functions backward compatible whenever possible.

In case this happens in the future, we could add a deprecation warning for cases that are more impactful than this minor change.

deruyter92 · 2026-05-22T09:12:21Z

+        # Default to fine-tuned + lazy HF auto-download so the animal API
+        # works out-of-the-box. Construction stays cheap (no network);
+        # the download fires on the first predict() call.
+        return (
+            SuperAnimalEstimator(SuperAnimalConfig(auto_download_finetuned=True)),
+            AnimalPostProcessor(),
+        )
    return HRNetEstimator(), HumanPostProcessor()


This seems to be inconsistent with how vis_animals.py resolves the path.

Here, is is allowed to be handled lazily in the predict() method.

In build_2d_estimator() the weights are downloaded directly and passed as pose_snapshot_path.

See my other comments in vis_animals.py. I think you intended the lazy handling in both, and I agree that it is probably better!

deruyter92 · 2026-05-22T09:14:44Z

+"""
+FMPose3D: monocular 3D Pose Estimation via Flow Matching
+
+Official implementation of the paper:
+"FMPose3D: monocular 3D Pose Estimation via Flow Matching"
+by Ti Wang, Xiaohang Yu, and Mackenzie Weygandt Mathis
+Licensed under Apache 2.0
+"""
+
+"""Bundled DLC ``pytorch_config.yaml`` files for the animal 2D detector.
+
+These yamls describe FMPose3D's fine-tuned SuperAnimal-Quadruped variants
+and are loaded by :class:`fmpose3d.inference_api.SuperAnimalEstimator` when
+the user does not supply an explicit ``pytorch_config_path``. They are
+shipped as package data (see ``pyproject.toml`` ``[tool.setuptools.package-data]``).
+"""


Suggested change

"""

FMPose3D: monocular 3D Pose Estimation via Flow Matching

Official implementation of the paper:

"FMPose3D: monocular 3D Pose Estimation via Flow Matching"

by Ti Wang, Xiaohang Yu, and Mackenzie Weygandt Mathis

Licensed under Apache 2.0

"""

"""Bundled DLC ``pytorch_config.yaml`` files for the animal 2D detector.

These yamls describe FMPose3D's fine-tuned SuperAnimal-Quadruped variants

and are loaded by :class:`fmpose3d.inference_api.SuperAnimalEstimator` when

the user does not supply an explicit ``pytorch_config_path``. They are

shipped as package data (see ``pyproject.toml`` ``[tool.setuptools.package-data]``).

"""

"""

FMPose3D: monocular 3D Pose Estimation via Flow Matching

Official implementation of the paper:

"FMPose3D: monocular 3D Pose Estimation via Flow Matching"

by Ti Wang, Xiaohang Yu, and Mackenzie Weygandt Mathis

Licensed under Apache 2.0

Bundled DLC ``pytorch_config.yaml`` files for the animal 2D detector.

These yamls describe FMPose3D's fine-tuned SuperAnimal-Quadruped variants

and are loaded by :class:`fmpose3d.inference_api.SuperAnimalEstimator` when

the user does not supply an explicit ``pytorch_config_path``. They are

shipped as package data (see ``pyproject.toml`` ``[tool.setuptools.package-data]``).

"""

Actually, I'm realizing that it would have probably been better to include the copyright header as comment (i.e. using #) instead of with a docstring. As the whole thing now appears when running help(), instead of only the module docstring.

deruyter92 · 2026-05-22T09:31:04Z

+             patch(
+                 "deeplabcut.pose_estimation_pytorch.apis.superanimal_analyze_images",
+             ) as mock_fn:
+            mock_fn.return_value = {"frame.png": {"bodyparts": fake_bp}}


is this working correctly? The code writes frames like "frame_000000.png" right?

xiu-cs added 28 commits May 15, 2026 21:33

Add arguments for 2D pose model overrides in opts class

d5d6136

Add configuration file for HRNet-w32 backbone fine-tuned on Animal3D

3aea87e

Add initial configuration file for animal 2D detector and HRNet-w32 f…

0d7d16f

…ine-tuning

Enhance SuperAnimalConfig with fine-tuning options and detailed mode …

cfe20da

…descriptions

Update vis_animals.sh to include saved_2d_model_path and reset saved_…

f66e63f

…model_path

Refactor 2D pose estimation in vis_animals.py to use SuperAnimalEstim…

59c960d

…ator and streamline debug handling

Refactor resolve_weights_path function in weights.py for improved cla…

06cbfc1

…rity and parameter naming

Update README.md to clarify pre-trained model usage and auto-download…

832c2d4

… process

Remove unused joint variables from vis_animals.sh

c439d22

Update model weights path resolution to include file extension

674afbe

Enhance SuperAnimalEstimator to support fine-tuned model loading and …

e4ec633

…auto-download functionality

Update README.md to enhance SuperAnimalEstimator description with fin…

090dcfc

…e-tuned and stock modes

Update model weights path resolution to include file extension for lo…

95293b4

…ading

Update DatasetConfig references from "rat7m" to "animal3d" and adjust…

ed3f084

… root path accordingly

Remove references to Rat7M dataset from Graph class and related docum…

bc4a886

…entation

Update action placeholder in main_animal3d.py for Animal3D dataset

f32ca40

Update dataset default value and root path in arguments.py for Animal…

54d1cb6

…3D compatibility

Add SuperAnimalConfig support and unit tests for fine-tuned mode in S…

ff801e5

…uperAnimalEstimator

Refactor get_pose2D function to streamline 2D pose estimation using S…

3279e88

…uperAnimalEstimator

Refactor 2D and 3D pose estimation functions to build estimators once…

4c20e0f

… and reuse across images, improving efficiency and clarity.

Add 'mot' to ignore words list in Codespell workflow

54ffab2

Refactor model loading to use device-agnostic code for CUDA compatibi…

e77e7f0

…lity

Refactor HRNetPose2d to use device-agnostic code for model and input …

6335c0b

…handling

Update Python version requirement and refine dependency constraints i…

4ebd20c

…n pyproject.toml

Remove unused import of coco_h36m from utilitys.py

e3fef16

Remove unnecessary configuration files and associated data

621b6a6

Update README.md to clarify Python version requirement and add PyTorc…

34a824f

…h/CUDA installation notes

xiu-cs changed the title ~~Ti dev~~ Add fine-tuned SuperAnimal-Quadruped support and improve demo setup May 17, 2026

xiu-cs requested a review from deruyter92 May 19, 2026 13:51

xiu-cs added 2 commits May 19, 2026 16:46

Fix checkpoint directory handling and improve weight loading logic in…

c6b1c4a

… main_animal3d.py

Refactor model path handling in test_animal3d.sh to clarify usage of …

6131b5e

…local weights

deruyter92 reviewed May 22, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add fine-tuned SuperAnimal-Quadruped support and improve demo setup#30

Add fine-tuned SuperAnimal-Quadruped support and improve demo setup#30
xiu-cs wants to merge 30 commits into
mainfrom
ti_dev

xiu-cs commented May 17, 2026 •

edited

Loading

Uh oh!

deruyter92 left a comment

Uh oh!

deruyter92 May 22, 2026

Uh oh!

deruyter92 May 22, 2026

Uh oh!

deruyter92 May 22, 2026

Uh oh!

deruyter92 May 22, 2026

Uh oh!

deruyter92 May 22, 2026

Uh oh!

deruyter92 May 22, 2026

Uh oh!

deruyter92 May 22, 2026

Uh oh!

deruyter92 May 22, 2026

Uh oh!

deruyter92 May 22, 2026

Uh oh!

deruyter92 May 22, 2026

Uh oh!

deruyter92 May 22, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants



		def resolve_weights_path(model_weights_path: str, model_type: str) -> str:
		def resolve_weights_path(local_path: str, filename: str) -> str:

Conversation

xiu-cs commented May 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Changes

Validation

Results

Uh oh!

deruyter92 left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

xiu-cs commented May 17, 2026 •

edited

Loading