Llama 3.2로 문장 생성 및 챗팅 완성 실습

Running Meta Llama on Linux 문서의 내용을 참고하여 Llama 3.2 1B 모델로 다음 두 가지 기능을 실습합니다.

문장 완성
챗팅 완성

실습 환경

Ubuntu 20.04.6 LTS
Python 3.12.7
Llama3.2-1B, Llama3.2-1B-Instruct
rustc 1.83.0
NVIDIA RTX 4090 24GB

프로그램 준비

실습에서 사용할 wget, md5sum 설치

sudo apt-get install wget
sudo apt-get install md5sum

NVIDIA GPU 설치 여부 확인
```
nvidia-smi
```

실습 디렉토리 만들기

mkdir llama3-demo
cd llama3-demo 
git clone https://github.com/meta-llama/llama3.git

Python 3.10 이상의 버전으로 가상환경 만들고 활성화
```
python -m venv llama-venv
. llama-venv/bin/activate
```
Rust 컴파일러 설치

How To Install Rust on Ubuntu 20.04 문서를 참고하여 Rust 컴파일러를 설치합니다.
```
curl --proto '=https' --tlsv1.3 https://sh.rustup.rs -sSf | sh
```
위 명령을 실행하면 아래와 같이 세 가지 선택 옵션이 나타나는데 그냥 엔터를 쳐서 1번 옵션으로 진행합니다.
```
...
1) Proceed with installation (default)
2) Customize installation
3) Cancel installation
```
아래 명령을 실행하여 현재 쉘에 반영하고 설치된 컴파일러 버전을 확인합니다.
```
source $HOME/.cargo/env
rustc --version
```
의존 라이브러리 설치
```
pip install -e .
```

모델 다운로드

Llama 웹 사이트에서 신청
- Request Access to Llama Models
선택 옵션에 따라 다운로드할 수 있는 모델들과 커스텀 URL이 화면에 표시되고 이메일로도 전송됩니다.

모델 목록 표시

llama model list

위 명령을 수행하면 아래와 같은 모델 목록이 표시됩니다.

+-----------------------------------------+-----------------------------------------------------+----------------+
| Model Descriptor                        | Hugging Face Repo                                   | Context Length |
+-----------------------------------------+-----------------------------------------------------+----------------+
| Llama3.2-1B                             | meta-llama/Llama-3.2-1B                             | 128K           |
+-----------------------------------------+-----------------------------------------------------+----------------+
| Llama3.2-3B                             | meta-llama/Llama-3.2-3B                             | 128K           |
+-----------------------------------------+-----------------------------------------------------+----------------+
| Llama3.2-11B-Vision                     | meta-llama/Llama-3.2-11B-Vision                     | 128K           |
+-----------------------------------------+-----------------------------------------------------+----------------+
| Llama3.2-90B-Vision                     | meta-llama/Llama-3.2-90B-Vision                     | 128K           |
+-----------------------------------------+-----------------------------------------------------+----------------+
| Llama3.2-1B-Instruct                    | meta-llama/Llama-3.2-1B-Instruct                    | 128K           |
+-----------------------------------------+-----------------------------------------------------+----------------+
| Llama3.2-3B-Instruct                    | meta-llama/Llama-3.2-3B-Instruct                    | 128K           |
+-----------------------------------------+-----------------------------------------------------+----------------+
| Llama3.2-1B-Instruct:int4-qlora-eo8     | meta-llama/Llama-3.2-1B-Instruct-QLORA_INT4_EO8     | 8K             |
+-----------------------------------------+-----------------------------------------------------+----------------+
| Llama3.2-1B-Instruct:int4-spinquant-eo8 | meta-llama/Llama-3.2-1B-Instruct-SpinQuant_INT4_EO8 | 8K             |
+-----------------------------------------+-----------------------------------------------------+----------------+
| Llama3.2-3B-Instruct:int4-qlora-eo8     | meta-llama/Llama-3.2-3B-Instruct-QLORA_INT4_EO8     | 8K             |
+-----------------------------------------+-----------------------------------------------------+----------------+
| Llama3.2-3B-Instruct:int4-spinquant-eo8 | meta-llama/Llama-3.2-3B-Instruct-SpinQuant_INT4_EO8 | 8K             |
+-----------------------------------------+-----------------------------------------------------+----------------+
| Llama3.2-11B-Vision-Instruct            | meta-llama/Llama-3.2-11B-Vision-Instruct            | 128K           |
+-----------------------------------------+-----------------------------------------------------+----------------+
| Llama3.2-90B-Vision-Instruct            | meta-llama/Llama-3.2-90B-Vision-Instruct            | 128K           |
+-----------------------------------------+-----------------------------------------------------+----------------+

모델 다운로드

llama download --source meta --model-id Llama3.2-1B
llama download --source meta --model-id Llama3.2-1B-Instruct

다운로드된 모델들은 아래 디렉토리에 저장됩니다.

$HOME/.llama/checkpoints

예제 실행

텍스트 생성 예제 실행
```
torchrun --nproc_per_node 1 example_text_completion.py --ckpt_dir $HOME/.llama/checkpoints/Llama3.2-1B/ --tokenizer_path $HOME/.llama/checkpoints/Llama3.2-1B/tokenizer.model --max_seq_len 128 --max_batch_size 4
```
example_text_completion.py 실행시 다음과 같은 에러가 발생합니다.
```
TypeError: ModelArgs.__init__() got an unexpected keyword argument 'use_scaled_rope'
```
이 문제를 해결하기 위하여 $HOME/.llama/checkpoints/Llama3.2-1B/params.json에서 "use_scaled_rope": true, 항목을 제거합니다.
- 참고 문서: unexpected keyword argument ‘use_scaled_rope’ error when running example script with Meta-Llama-3.1-8B #291

질의 응답 예제 실행

torchrun --nproc_per_node 1 example_chat_completion.py --ckpt_dir $HOME/.llama/checkpoints/Llama3.2-1B-Instruct/ --tokenizer_path $HOME/.llama/checkpoints/Llama3.2-1B-Instruct/tokenizer.model --max_seq_len 512 --max_batch_size 6

Written with StackEdit.

Intel MKL 예제를 Microsoft Visual C++로 빌드하기

Intel MKL 예제를 Microsoft Visual C++로 빌드하기 인텔 프로세서 시스템에서 아래의 영역에 해당하는 수학 계산을 빠르게 수행하고자 한다면 Intel MKL 라이브러리를 사용할 수 있습니다. Linear Algebra Fast Fourier Transforms (FFT) Vector Statistics & Data Fitting Vector Math & Miscellaneous Solvers 이 문서는 Intel MKL 이 제공하는 예제 파일을 Microsoft Visual C++ 로 컴파일하고 링크하여 실행 파일을 만드는 과정을 소개합니다. 빌드 환경 다음은 이 문서를 작성하는 과정에서 Intel MKL 예제를 빌드하기 위하여 사용한 환경입니다. 시스템 운영체제: Windows 10 (64비트) 프로세서: Intel Core i7 설치 제품 IDE: Microsoft Visual Studio Community 2019 (version 16) 라이브러리: Intel Math Kernel Library 2019 Update 5 환경 변수 명령 프롬프트 창을 엽니다. 아래 스크립트를 실행하여 환경 변수 INCLUDE , LIB , 그리고 PATH 를 설정합니다. @echo off set CPRO_PATH=C:\Program Files (x86)\IntelSWTools\compilers_and_libraries\windows set MKLROOT=%CPRO_PATH%\mkl set REDIST=%CPRO_PATH%\redist set INCLUDE=%MKLROOT%\include;%INCLUDE% set LIB=%MKLROOT%\lib\intel64;%LIB% set PATH=%REDIST%\intel64\mkl;%PATH% REM for OpenMP intel thread set LIB=%CPRO_PATH%\compiler\lib...

자세한 내용 보기

천천히, 제대로

이 블로그 검색