LLaMA微调记录

本文基于开源代码https://github.com/Lightning-AI/lit-llama/tree/main执行微调

其他参考链接：

Accelerating LLaMA with Fabric: A Comprehensive Guide to Training and Fine-Tuning LLaMA – Lightning AI

结构化数据示例： BelleGroup/train_0.5M_CN · Datasets at Hugging Face

Table of Contents

finetune记录

在Alpaca数据集上finetune

下载代码，配置环境：https://github.com/Lightning-AI/lit-llama/blob/main/README.md

准备模型：https://github.com/Lightning-AI/lit-llama/blob/main/howto/download_weights.md

finetune on Alpaca dataset:https://github.com/Lightning-AI/lit-llama/blob/main/README.md

在个人数据集finetune

准备数据集：

对于结构化数据集（json结构，包含“instructions”、“output”字段），复制lit-llama-main/scripts/prepare_alpaca.py代码，修改相应的输入文件、训练校验集划分、保存路径等

对于非结构化数据集，复制lit-llama-main/scripts/prepare_any_text.py代码，修改相应内容。可参考以下链接https://github.com/Lightning-AI/lit-llama/blob/main/howto/unstructured_dataset.md

训练：

修改lit-llama-main/finetune/lora.py代码中的数据集路径、输出数据路径，运行代码即开始训练

推理：

修改lit-llama-main/generate/lora.py模型路径，进行推理

改bug记录

bug:

执行generate.py出错：ValueError: Precision ‘bf16-true’ is invalid. Allowed precision values: (’16-mixed’, ‘bf16-mixed’, ’32-true’, ’64-true’, 64, 32, 16, ’64’, ’32’, ’16’, ‘bf16’)

解决:

lightning 版本错误

pip uninstall -y lightning; pip install -r requirements.txt

或下载lighning zip,然后 pip install lightning-master.zip -U
GitHub – Lightning-AI/lightning: Deep learning framework to train, deploy, and ship AI products Lightning fast.cc

参考：

ValueError: Precision ‘bf16-true’ is invalid. Allowed precision values: (’16-mixed’, ‘bf16-mixed’, ’32-true’, ’64-true’, 64, 32, 16, ’64’, ’32’, ’16’, ‘bf16’) · Issue #241 · Lightning-AI/lit-gpt · GitHub

bug：No module named ‘torch.utils._device’

解决：No module named ‘torch.utils._device’ · Issue #135 · Lightning-AI/lit-llama · GitHub

bug:RuntimeError: CUDA error: device-side assert triggered CUDA kernel errors might be asynchronously reported at some other API call, so the stacktrace below might be incorrect. For debugging consider passing CUDA_LAUNCH_BLOCKING=1. Compile with `TORCH_USE_CUDA_DSA` to enable device-side assertions.

解决：bug记录往上翻看到是max_seq_length的问题，定位到对应函数，传入更大的max_seq_length。

文章出处登录后可见！

已经登录？立即刷新

LLaMA微调记录

finetune记录

在Alpaca数据集上finetune

在个人数据集finetune

改bug记录

相关推荐