feat: image driven, regional animation, doc

2025-03-15 05:52:58 +00:00 · 2024-08-19 17:21:49 +08:00 · 2024-08-19 17:21:49 +08:00 · 424a5c74e7
commit 424a5c74e7
parent d1cb6bbc27
14 changed files with 55 additions and 4 deletions
--- a/app.py
+++ b/app.py
@ -401,9 +401,10 @@ with gr.Blocks(theme=gr.themes.Soft(font=[gr.themes.GoogleFont("Plus Jakarta San
                    examples=[
                        [osp.join(example_portrait_dir, "s13.mp4")],
                        # [osp.join(example_portrait_dir, "s18.mp4")],
-                        [osp.join(example_portrait_dir, "s20.mp4")],
+                        # [osp.join(example_portrait_dir, "s20.mp4")],
                        [osp.join(example_portrait_dir, "s29.mp4")],
                        [osp.join(example_portrait_dir, "s32.mp4")],
+                        [osp.join(example_video_dir, "d3.mp4")],
                    ],
                    inputs=[retargeting_input_video],
                    cache_examples=False,
--- a/assets/docs/changelog/2024-08-19.md
+++ b/assets/docs/changelog/2024-08-19.md
@ -0,0 +1,49 @@
+# Image-driven animation and regional animation mode
+
+Inspired by [ComfyUI-AdvancedLivePortrait](https://github.com/PowerHouseMan/ComfyUI-AdvancedLivePortrait) ([@PowerHouseMan](https://github.com/PowerHouseMan)), we added the image-driven portrait animation and image-driven portrait video editing, as well as the regional animation mode. We also improved the absolute driving mode.
+
+## Arguments Description
+
+You can modify the argument options in `src/config/argument_config.py`. Below we explain the newly added arguments and the arguments that may confuse.
+
+### Image-driven portrait animation and portrait video editing ###
+
+- **driving**: You can set the image path of the driving option to your given driving image path as below:
+```bash
+python inference.py -d assets/examples/driving/d30.jpg
+```
+You can also upload the driving image to the corresponding location in the Gradio interface.
+<p align="center">
+  <img src="../image-driven-portrait-animation-2024-08-19.jpg" alt="LivePortrait" width="960px">
+  <br>
+  Image-driven portrait animation
+  <br><br>
+  <img src="../image-driven-portrait-video-editing-2024-08-19.jpg" alt="LivePortrait" width="960px">
+  <br>
+  Image-driven portrait video editing
+</p>
+
+- **flag_relative_motion**: If the driving input is an image and you set `flag_relative_motion` to true, then the driving motion will be the motion deformation between the driving image and its canonical image. If you set `flag_relative_motion` to false, the driving motion will be the absolute motion of the driving image, which may result in greater expression driving strength and a certain degree of identity leakage. `flag_relative_motion` corresponds to the `relative motion` option in the Gradio interface.
+
+In addition, if both source input and driving input are images, the output will be an image. If source input is a video and driving input is an image, the animated result is a video, and the driving motion of each frame is provided by the driving image. In the Gradio interface, the output will be automatically saved in the corresponding format (image or video) and displayed in the corresponding window.
+
+### Regional animation mode ###
+- **animation_region**: This argument contains five options. The `exp` option means that only the expression part of the driving input is used to drive the source input, `pose` means that only the head pose part of the driving input is used to drive the source input, `lip` means that only the lip movement part of the driving input is used to drive the source input, `eyes` means that only the eye movement part of the driving input is used to drive the source input, and `all` means that all motions of the driving input are used to drive the source input. Alternatively, you can also select the corresponding option in the following area in the gradio interface.
+
+<p align="center">
+  <img src="../regional-animation-2024-08-19.jpg" alt="LivePortrait" width="659px">
+  <br>
+  Regional animation mode
+</p>
+
+### Editing the lip region of the source video to the neutral expression ###
+Some issues mentioned that in the `Retargeting Video` of the Gradio interface, it is hoped that not only the lips can be closed, but also the lip region of the source video can be edited into a neutral expression. Therefore, we provide a `keeping the lip silent` option. When you select this option, the lip region of the animated video will be the neutral expression. Since this uses a mode similar to absolute driving, this may cause inter-frame jitter or identity leakage in the animated video. Note that the neutral expression sometimes has a slightly open mouth.
+
+### More explanation ###
+When both the source input and the driving input are videos, some issues mentioned that the output motion is a mixture of the source video and the driving video motions. This is because the `flag_relative_motion` argument is enabled by default. In other words, we are using relative driving, not absolute driving. The difference between the two is that `flag_relative_motion` means that the motion offset of the current driving frame relative to the first driving frame will be added to the motion of the source frame as the final driving motion, while `no_flag_relative_motion` means that the motion of the current driving frame will be directly used as the final driving motion.
+
+So if you want to keep only the motion of the driving video in the animated video, you can use:
+```bash
+python inference.py --no_flag_relative_motion
+```
+Or you can uncheck the `relative motion` option in the Gradio interface. Using absolute driving may cause jitter or identity leakage in the animated video.
--- a/assets/docs/image-driven-portrait-animation-2024-08-19.jpg
+++ b/assets/docs/image-driven-portrait-animation-2024-08-19.jpg
--- a/assets/docs/image-driven-portrait-video-editing-2024-08-19.jpg
+++ b/assets/docs/image-driven-portrait-video-editing-2024-08-19.jpg
--- a/assets/docs/regional-animation-2024-08-19.jpg
+++ b/assets/docs/regional-animation-2024-08-19.jpg
--- a/assets/examples/driving/d12.jpg
+++ b/assets/examples/driving/d12.jpg
--- a/assets/examples/driving/d19.jpg
+++ b/assets/examples/driving/d19.jpg
--- a/assets/examples/driving/d30.jpg
+++ b/assets/examples/driving/d30.jpg
--- a/assets/examples/driving/d38.jpg
+++ b/assets/examples/driving/d38.jpg
--- a/assets/examples/driving/d8.jpg
+++ b/assets/examples/driving/d8.jpg
--- a/assets/examples/driving/d9.jpg
+++ b/assets/examples/driving/d9.jpg
--- a/readme.md
+++ b/readme.md
@ -38,6 +38,7 @@


 ## 🔥 Updates
+- **`2024/08/19`**: 🖼️ We support **image driven and regional control**, insipred by [ComfyUI-AdvancedLivePortrait](https://github.com/PowerHouseMan/ComfyUI-AdvancedLivePortrait). See [**here**](./assets/docs/changelog/2024-08-19.md).
 - **`2024/08/06`**: 🎨 We support **precise portrait editing** in the Gradio interface, insipred by [ComfyUI-AdvancedLivePortrait](https://github.com/PowerHouseMan/ComfyUI-AdvancedLivePortrait). See [**here**](./assets/docs/changelog/2024-08-06.md).
 - **`2024/08/05`**: 📦 Windows users can now download the [one-click installer](https://huggingface.co/cleardusk/LivePortrait-Windows/blob/main/LivePortrait-Windows-v20240806.zip) for Humans mode and **Animals mode** now! For details, see [**here**](./assets/docs/changelog/2024-08-05.md).
 - **`2024/08/02`**: 😸 We released a version of the **Animals model**, along with several other updates and improvements. Check out the details [**here**](./assets/docs/changelog/2024-08-02.md)!
--- a/src/utils/cropper.py
+++ b/src/utils/cropper.py
@ -66,14 +66,14 @@ class Cropper(object):
                    providers=face_analysis_wrapper_provider,
                )
        self.face_analysis_wrapper.prepare(ctx_id=device_id, det_size=(512, 512), det_thresh=self.crop_cfg.det_thresh)
-        # self.face_analysis_wrapper.warmup()
+        self.face_analysis_wrapper.warmup()

        self.human_landmark_runner = HumanLandmark(
            ckpt_path=self.crop_cfg.landmark_ckpt_path,
            onnx_provider=device,
            device_id=device_id,
        )
-        # self.human_landmark_runner.warmup()
+        self.human_landmark_runner.warmup()

        if self.image_type == "animal_face":
            from .animal_landmark_runner import XPoseRunner as AnimalLandmarkRunner
@ -83,7 +83,7 @@ class Cropper(object):
                    embeddings_cache_path=self.crop_cfg.xpose_embedding_cache_path,
                    flag_use_half_precision=kwargs.get("flag_use_half_precision", True),
                )
-            # self.animal_landmark_runner.warmup()
+            self.animal_landmark_runner.warmup()

    def update_config(self, user_args):
        for k, v in user_args.items():
--- a/src/utils/resources/lip_array.pkl
+++ b/src/utils/resources/lip_array.pkl