feat: update animals model (#470)

* chore: updare the animals model version * chore: remove duplicate line * chore: update readme * chore: update readme * chore: update changelog * chore: update changelog * chore: update changelog * chore: update changelog * chore: update changelog
2025-02-05 18:08:13 +00:00 · 2025-01-01 18:10:01 +08:00 · 2025-01-01 18:10:01 +08:00 · 1df473f067
commit 1df473f067
parent 632da7486d
9 changed files with 44 additions and 11 deletions
--- a/app_animals.py
+++ b/app_animals.py
@ -96,6 +96,7 @@ with gr.Blocks(theme=gr.themes.Soft(font=[gr.themes.GoogleFont("Plus Jakarta San
                        [osp.join(example_portrait_dir, "s30.jpg")],
                        [osp.join(example_portrait_dir, "s31.jpg")],
                        [osp.join(example_portrait_dir, "s32.jpg")],
                        [osp.join(example_portrait_dir, "s33.jpg")],
                        [osp.join(example_portrait_dir, "s39.jpg")],
                        [osp.join(example_portrait_dir, "s40.jpg")],
                        [osp.join(example_portrait_dir, "s41.jpg")],
--- a/assets/docs/changelog/2025-01-01.md
+++ b/assets/docs/changelog/2025-01-01.md
@ -0,0 +1,29 @@
 ## 2025/01/01
 **We’re thrilled that cats 🐱 are now speaking and singing across the internet!**  🎶
 In this update, we’ve improved the [Animals model](https://huggingface.co/KwaiVGI/LivePortrait/tree/main/liveportrait_animals/base_models_v1.1) with more data. While you might notice only a slight improvement for cats (if at all 😼), dogs have gotten a slightly better upgrade. For example, the model is now better at recognizing their mouths instead of mistaking them for noses. 🐶
 <table class="center" style="width: 80%; margin-left: auto; margin-right: auto;">
 <tr>
    <td style="text-align: center"><b>Before vs. After (v1.1)</b></td>
 </tr>
 <tr>
    <td style="border: none; text-align: center;">
        <video controls loop src="https://github.com/user-attachments/assets/59fc09b9-6cb7-4265-833f-eebb27ed9511" muted="false" style="width: 60%;"></video>
    </td>
 </tr>
 </table>
 The new version (v1.1) Animals Model has been updated on [HuggingFace](https://huggingface.co/KwaiVGI/LivePortrait/tree/main/liveportrait_animals/base_models_v1.1). The new version is enabled by default.
 > [!IMPORTANT]
 > Note: Make sure to update your weights to use the new version.
 If you prefer to use the original version, simply modify the configuration in [inference_config.py](../../../src/config/inference_config.py#L29)
 ```python
 version_animals = "" # old version
 # version_animals = "_v1.1" # new (v1.1) version
 ```
--- a/assets/examples/source/s33.jpg
+++ b/assets/examples/source/s33.jpg
--- a/inference.py
+++ b/inference.py
@ -1,6 +1,7 @@
 # coding: utf-8
 """
-for human
+The entrance of humans
 """
 import os
--- a/inference_animals.py
+++ b/inference_animals.py
@ -1,6 +1,7 @@
 # coding: utf-8
 """
-for animal
+The entrance of animal
 """
 import os
--- a/readme.md
+++ b/readme.md
@ -41,6 +41,7 @@
 ## 🔥 Updates
 - **`2025/01/01`**: 🐶 We updated a new version of the Animals model with more data, see [**here**](./assets/docs/changelog/2025-01-01.md).
 - **`2024/10/18`**: ❗ We have updated the versions of the `transformers` and `gradio` libraries to avoid security vulnerabilities. Details [here](https://github.com/KwaiVGI/LivePortrait/pull/421/files).
 - **`2024/08/29`**: 📦 We update the Windows [one-click installer](https://huggingface.co/cleardusk/LivePortrait-Windows/blob/main/LivePortrait-Windows-v20240829.zip) and support auto-updates, see [changelog](https://huggingface.co/cleardusk/LivePortrait-Windows#20240829).
 - **`2024/08/19`**: 🖼️ We support **image driven mode** and **regional control**. For details, see [**here**](./assets/docs/changelog/2024-08-19.md).
--- a/src/config/inference_config.py
+++ b/src/config/inference_config.py
@ -26,10 +26,12 @@ class InferenceConfig(PrintableConfig):
    checkpoint_S: str = make_abs_path('../../pretrained_weights/liveportrait/retargeting_models/stitching_retargeting_module.pth')  # path to checkpoint to S and R_eyes, R_lip
    # ANIMAL MODEL CONFIG, NOT EXPORTED PARAMS
-    checkpoint_F_animal: str = make_abs_path('../../pretrained_weights/liveportrait_animals/base_models/appearance_feature_extractor.pth')  # path to checkpoint of F
+    # version_animals = "" # old version
-    checkpoint_M_animal: str = make_abs_path('../../pretrained_weights/liveportrait_animals/base_models/motion_extractor.pth')  # path to checkpoint pf M
+    version_animals = "_v1.1" # new (v1.1) version
-    checkpoint_G_animal: str = make_abs_path('../../pretrained_weights/liveportrait_animals/base_models/spade_generator.pth')  # path to checkpoint of G
+    checkpoint_F_animal: str = make_abs_path(f'../../pretrained_weights/liveportrait_animals/base_models{version_animals}/appearance_feature_extractor.pth')  # path to checkpoint of F
-    checkpoint_W_animal: str = make_abs_path('../../pretrained_weights/liveportrait_animals/base_models/warping_module.pth')  # path to checkpoint of W
+    checkpoint_M_animal: str = make_abs_path(f'../../pretrained_weights/liveportrait_animals/base_models{version_animals}/motion_extractor.pth')  # path to checkpoint pf M
    checkpoint_G_animal: str = make_abs_path(f'../../pretrained_weights/liveportrait_animals/base_models{version_animals}/spade_generator.pth')  # path to checkpoint of G
    checkpoint_W_animal: str = make_abs_path(f'../../pretrained_weights/liveportrait_animals/base_models{version_animals}/warping_module.pth')  # path to checkpoint of W
    checkpoint_S_animal: str = make_abs_path('../../pretrained_weights/liveportrait/retargeting_models/stitching_retargeting_module.pth')  # path to checkpoint to S and R_eyes, R_lip, NOTE: use human temporarily!
    # EXPORTED PARAMS
--- a/src/utils/cropper.py
+++ b/src/utils/cropper.py
@ -207,12 +207,10 @@ class Cropper(object):
                vy_ratio=crop_cfg.vy_ratio,
                flag_do_rot=crop_cfg.flag_do_rot,
            )
            lmk = self.human_landmark_runner.run(frame_rgb, lmk)
            ret_dct["lmk_crop"] = lmk
            # update a 256x256 version for network input
            ret_dct["img_crop_256x256"] = cv2.resize(ret_dct["img_crop"], (256, 256), interpolation=cv2.INTER_AREA)
-            ret_dct["lmk_crop_256x256"] = ret_dct["lmk_crop"] * 256 / crop_cfg.dsize
+            ret_dct["lmk_crop_256x256"] = ret_dct["pt_crop"] * 256 / crop_cfg.dsize
            trajectory.frame_rgb_crop_lst.append(ret_dct["img_crop_256x256"])
            trajectory.lmk_crop_lst.append(ret_dct["lmk_crop_256x256"])