From 6a7efe27422b43f163e67932b63714abc20f1057 Mon Sep 17 00:00:00 2001
From: Jianzhu Guo <guojianzhu@kuaishou.com>
Date: Mon, 5 Aug 2024 18:04:26 +0800
Subject: [PATCH] feat: update env, doc, one-installer (#283)

* chore: remove transformers for macOS

* chore: specify torch cuda version

* chore: update install instruction

* chore: update install instruction

* chore: update readme

* doc: update

---------

Co-authored-by: zzzweakman <1819489045@qq.com>
---
 assets/docs/changelog/2024-08-02.md |  7 ++---
 assets/docs/changelog/2024-08-05.md | 18 +++++++++++++
 readme.md                           | 40 ++++++++++++++++++++++++-----
 requirements.txt                    |  2 ++
 requirements_base.txt               |  6 +----
 requirements_macOS.txt              |  3 +++
 6 files changed, 61 insertions(+), 15 deletions(-)
 create mode 100644 assets/docs/changelog/2024-08-05.md
diff --git a/assets/docs/changelog/2024-08-02.md b/assets/docs/changelog/2024-08-02.md
index 7503311..17b75a4 100644
--- a/assets/docs/changelog/2024-08-02.md
+++ b/assets/docs/changelog/2024-08-02.md
@@ -13,14 +13,14 @@
 </table>
 
 
-🎉 We are excited to announce the release of a new version featuring animals mode, along with several other updates. Special thanks to the dedicated efforts of the LivePortrait team. 💪
+🎉 We are excited to announce the release of a new version featuring animals mode, along with several other updates. Special thanks to the dedicated efforts of the LivePortrait team. 💪 We also provided an one-click installer for Windows users, checkout the details [here](./2024-08-05.md).
 
 ### Updates on Animals mode
 We are pleased to announce the release of the animals mode, which is fine-tuned on approximately 230K frames of various animals (mostly cats and dogs). The trained weights have been updated in the `liveportrait_animals` subdirectory, available on [HuggingFace](https://huggingface.co/KwaiVGI/LivePortrait/tree/main/) or [Google Drive](https://drive.google.com/drive/u/0/folders/1UtKgzKjFAOmZkhNK-OYT0caJ_w2XAnib). You should [download the weights](https://github.com/KwaiVGI/LivePortrait?tab=readme-ov-file#2-download-pretrained-weights) before running. There are two ways to run this mode.
 
 > Please note that we have not trained the stitching and retargeting modules for the animals model due to several technical issues. _This may be addressed in future updates._ Therefore, we recommend **disabling stitching by setting the `--no_flag_stitching`** option when running the model. Additionally, `paste-back` is also not recommended.
 
-Before launching, ensure you have installed `transformers==4.22.0`, `pillow>=10.2.0`, which are already updated in  [`requirements_base.txt`](../../../requirements_base.txt). We have chosen [XPose](https://github.com/IDEA-Research/X-Pose) as the keypoints detector for animals. This relies on `transformers` and requires building an OP named `MultiScaleDeformableAttention` by
+Before launching, ensure you have installed `transformers==4.22.0`, `pillow>=10.2.0`, which are already updated in  [`requirements_base.txt`](../../../requirements_base.txt). We have chosen [X-Pose](https://github.com/IDEA-Research/X-Pose) as the keypoints detector for animals. This relies on `transformers` and requires building an OP named `MultiScaleDeformableAttention` by
 ```bash
 cd src/utils/dependencies/XPose/models/UniPose/ops
 python setup.py build install
@@ -38,7 +38,7 @@ python app_animals.py # --server_port 8889 --server_name "0.0.0.0" --share
 ```
 
 > [!WARNING]
-> [XPose](https://github.com/IDEA-Research/X-Pose) is only for Non-commercial Scientific Research Purposes, you should remove and replace it with other detectors if you use it for commercial purposes.
+> [X-Pose](https://github.com/IDEA-Research/X-Pose) is only for Non-commercial Scientific Research Purposes, you should remove and replace it with other detectors if you use it for commercial purposes.
 
 ### Updates on Humans mode
 
@@ -50,6 +50,7 @@ python app_animals.py # --server_port 8889 --server_name "0.0.0.0" --share
 
 - [**Poe supports LivePortrait**](https://poe.com/LivePortrait). Check out the news on [X](https://x.com/poe_platform/status/1816136105781256260).
 - [ComfyUI-LivePortraitKJ](https://github.com/kijai/ComfyUI-LivePortraitKJ) (1.1K 🌟) now includes MediaPipe as an alternative to InsightFace, ensuring the license remains under MIT and Apache 2.0.
+- [ComfyUI-AdvancedLivePortrait](https://github.com/PowerHouseMan/ComfyUI-AdvancedLivePortrait) features real-time portrait pose/expression editing and animation, and is registered with ComfyUI-Manager.
 
 
 
diff --git a/assets/docs/changelog/2024-08-05.md b/assets/docs/changelog/2024-08-05.md
new file mode 100644
index 0000000..a265210
--- /dev/null
+++ b/assets/docs/changelog/2024-08-05.md
@@ -0,0 +1,18 @@
+## One-click Windows Installer
+
+### Download the installer from HuggingFace
+```bash
+# !pip install -U "huggingface_hub[cli]"
+huggingface-cli download cleardusk/LivePortrait-Windows LivePortrait-Windows-v20240805.zip
+```
+
+If you cannot access to Huggingface, you can use [hf-mirror](https://hf-mirror.com/) to download:
+```bash
+# !pip install -U "huggingface_hub[cli]"
+export HF_ENDPOINT=https://hf-mirror.com
+huggingface-cli download cleardusk/LivePortrait-Windows LivePortrait-Windows-v20240805.zip
+```
+
+Alternatively, you can manually download it from the [HuggingFace](https://huggingface.co/cleardusk/LivePortrait-Windows/blob/main/LivePortrait-Windows-v20240805.zip) page.
+
+Then, simply unzip the package `LivePortrait-Windows-v20240805.zip` and double-click `run_windows_human.bat` for the Humans mode, or `run_windows_animal.bat` for the **Animals mode**.
diff --git a/readme.md b/readme.md
index dec8d85..3c0be3e 100644
--- a/readme.md
+++ b/readme.md
@@ -37,8 +37,8 @@
 </p>
 
 
-
 ## 🔥 Updates
+- `2024/08/05`: 📦 Windows users download the [one-click installer](https://huggingface.co/cleardusk/LivePortrait-Windows/blob/main/LivePortrait-Windows-v20240805.zip) for Humans mode and **Animals mode** now! For details, see [**here**](./assets/docs/changelog/2024-08-05.md).
 - **`2024/08/02`**: 😸 We released a version of the **Animals model**, along with several other updates and improvements. Check out the details [**here**](./assets/docs/changelog/2024-08-02.md)!
 - **`2024/07/25`**: 📦 Windows users can now download the package from [HuggingFace](https://huggingface.co/cleardusk/LivePortrait-Windows/tree/main) or [BaiduYun](https://pan.baidu.com/s/1FWsWqKe0eNfXrwjEhhCqlw?pwd=86q2). Simply unzip and double-click `run_windows.bat` to enjoy!
 - **`2024/07/24`**: 🎨 We support pose editing for source portraits in the Gradio interface. We’ve also lowered the default detection threshold to increase recall. [Have fun](assets/docs/changelog/2024-07-24.md)!
@@ -57,6 +57,10 @@ We are actively updating and improving this repository. If you find any bugs or
 
 ## Getting Started 🏁
 ### 1. Clone the code and prepare the environment 🛠️
+
+> [!Note]
+> Make sure your system has [`git`](https://git-scm.com/), [`conda`](https://anaconda.org/anaconda/conda), and [`FFmpeg`](https://ffmpeg.org/download.html) installed. For details on FFmpeg installation, see [**how to install FFmpeg**](assets/docs/how-to-install-ffmpeg.md).
+
 ```bash
 git clone https://github.com/KwaiVGI/LivePortrait
 cd LivePortrait
@@ -64,16 +68,37 @@ cd LivePortrait
 # create env using conda
 conda create -n LivePortrait python=3.9
 conda activate LivePortrait
+```
 
-# for Linux and Windows users
+#### For Linux or Windows Users
+
+The [X-Pose](https://github.com/IDEA-Research/X-Pose) dependency has **strict limitations** on the CUDA version. To check your current CUDA version, run the following command:
+```bash
+nvcc -V # example versions: 11.1, 11.8, 12.1, etc.
+```
+We provide installation commands for `torch` corresponding to three common CUDA versions. If your version is not listed, please visit [PyTorch Official Website](https://pytorch.org/get-started/previous-versions/) to find the installation command for your CUDA version.
+```bash
+# for Linux and Windows users (choose one based on your CUDA version):
+# for CUDA 11.1
+pip install torch==1.10.1+cu111 torchvision==0.11.2 torchaudio==0.10.1 -f https://download.pytorch.org/whl/cu111/torch_stable.html
+# for CUDA 11.8
+pip install torch==2.3.0 torchvision==0.18.0 torchaudio==2.3.0 --index-url https://download.pytorch.org/whl/cu118
+# for CUDA 12.1
+pip install torch==2.3.0 torchvision==0.18.0 torchaudio==2.3.0 --index-url https://download.pytorch.org/whl/cu121
+# ...
+```
+Finally, install the remaining dependencies:
+```bash
 pip install -r requirements.txt
+```
+
+#### For macOS with Apple Silicon Users
+The [X-Pose](https://github.com/IDEA-Research/X-Pose) dependency does not support macOS, so you can skip its installation. While Humans mode works as usual, Animals mode is not supported. Use the provided requirements file for macOS with Apple Silicon:
+```bash
 # for macOS with Apple Silicon users
 pip install -r requirements_macOS.txt
 ```
 
-> [!Note]
-> Make sure your system has [`git`](https://git-scm.com/), [`conda`](https://anaconda.org/anaconda/conda), and [`FFmpeg`](https://ffmpeg.org/download.html) installed. For details on FFmpeg installation, see [**how to install FFmpeg**](assets/docs/how-to-install-ffmpeg.md).
-
 ### 2. Download pretrained weights 📥
 
 The easiest way to download the pretrained weights is from HuggingFace:
@@ -124,7 +149,7 @@ python inference.py -h
 ```
 
 #### Fast hands-on (animals) 🐱🐶
-Animals mode is ONLY tested on Linux with NVIDIA GPU.
+Animals mode is ONLY tested on Linux and Windows with NVIDIA GPU.
 
 You need to build an OP named `MultiScaleDeformableAttention` first, which is used by [X-Pose](https://github.com/IDEA-Research/X-Pose), a general keypoint detection framework.
 ```bash
@@ -207,6 +232,7 @@ The results are [**here**](./assets/docs/speed.md).
 Discover the invaluable resources contributed by our community to enhance your LivePortrait experience:
 
 - [ComfyUI-LivePortraitKJ](https://github.com/kijai/ComfyUI-LivePortraitKJ) by [@kijai](https://github.com/kijai)
+- [ComfyUI-AdvancedLivePortrait](https://github.com/PowerHouseMan/ComfyUI-AdvancedLivePortrait) by [@PowerHouseMan](https://github.com/PowerHouseMan).
 - [comfyui-liveportrait](https://github.com/shadowcz007/comfyui-liveportrait) by [@shadowcz007](https://github.com/shadowcz007)
 - [LivePortrait In ComfyUI](https://www.youtube.com/watch?v=aFcS31OWMjE) by [@Benji](https://www.youtube.com/@TheFutureThinker)
 - [LivePortrait hands-on tutorial](https://www.youtube.com/watch?v=uyjSTAOY7yI) by [@AI Search](https://www.youtube.com/@theAIsearch)
@@ -216,7 +242,7 @@ Discover the invaluable resources contributed by our community to enhance your L
 And many more amazing contributions from our community!
 
 ## Acknowledgements 💐
-We would like to thank the contributors of [FOMM](https://github.com/AliaksandrSiarohin/first-order-model), [Open Facevid2vid](https://github.com/zhanglonghao1992/One-Shot_Free-View_Neural_Talking_Head_Synthesis), [SPADE](https://github.com/NVlabs/SPADE), [InsightFace](https://github.com/deepinsight/insightface) repositories, for their open research and contributions.
+We would like to thank the contributors of [FOMM](https://github.com/AliaksandrSiarohin/first-order-model), [Open Facevid2vid](https://github.com/zhanglonghao1992/One-Shot_Free-View_Neural_Talking_Head_Synthesis), [SPADE](https://github.com/NVlabs/SPADE), [InsightFace](https://github.com/deepinsight/insightface) and [X-Pose](https://github.com/IDEA-Research/X-Pose) repositories, for their open research and contributions.
 
 ## Citation 💖
 If you find LivePortrait useful for your research, welcome to 🌟 this repo and cite our work using the following BibTeX:
diff --git a/requirements.txt b/requirements.txt
index 9717d4a..67c5c21 100644
--- a/requirements.txt
+++ b/requirements.txt
@@ -1,2 +1,4 @@
 -r requirements_base.txt
+
 onnxruntime-gpu==1.18.0
+transformers==4.22.0
diff --git a/requirements_base.txt b/requirements_base.txt
index 29106aa..e37f84a 100644
--- a/requirements_base.txt
+++ b/requirements_base.txt
@@ -1,6 +1,3 @@
-torch==2.3.0
-torchvision==0.18.0
-torchaudio==2.3.0
 numpy==1.26.4
 pyyaml==6.0.1
 opencv-python==4.10.0.84
@@ -18,5 +15,4 @@ imageio-ffmpeg==0.5.1
 tyro==0.8.5
 gradio==4.37.1
 pykalman==0.9.7
-transformers==4.22.0
-pillow>=10.2.0
+pillow>=10.2.0
\ No newline at end of file
diff --git a/requirements_macOS.txt b/requirements_macOS.txt
index 8225548..d4fa46b 100644
--- a/requirements_macOS.txt
+++ b/requirements_macOS.txt
@@ -1,4 +1,7 @@
 -r requirements_base.txt
 
 --extra-index-url https://download.pytorch.org/whl/cpu
+torch==2.3.0
+torchvision==0.18.0
+torchaudio==2.3.0
 onnxruntime-silicon==1.16.3