feat: Add LoRA (Low-Rank Adaptation) support for efficient model fine-tuning by chen2021673 · Pull Request #108 · InfiniTensor/InfiniTrain

chen2021673 · 2026-02-12T09:14:16Z

Summary

Added LoRA (Low-Rank Adaptation) support for parameter-efficient fine-tuning. This feature significantly reduces the number of trainable parameters through low-rank decomposition, enabling efficient fine-tuning of large models.

Changes

New Features

LoRA Infrastructure (infini_train/include/nn/lora/):

lora_config.h/cc - LoRA configuration (rank, alpha, dropout)
lora_linear.h/cc - LoRA linear layer wrapper
lora_model.h/cc - Multi-LoRA layer management
lora_parallel_linear.h/cc - Tensor parallelism support
lora_utils.h/cc - Utility functions

Tests:

test/lora/test_lora.cc - Unit tests

Documentation:

docs/lora_usage.md - Usage documentation

Examples:

example/gpt2/main.cc - Added LoRA training example

Build:

CMakeLists.txt - Added test_lora build target

Test Result

精度：

性能：

llama3 运行结果对比：

- Add LoRA module infrastructure with configurable rank, alpha, dropout - Implement LoRALinear wrapper for seamless integration with Linear layers - Support tensor parallelism via LoRAParallelLinear - Add LoRAModel utility for managing multiple LoRA layers - Integrate LoRA configuration and utilities - Add GPT2 example demonstrating LoRA fine-tuning - Include comprehensive usage documentation and test suite Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

…-tuning

- Refactor LoRA config construction with proper target module parsing - Add GetLoRAModel for in-place LoRA layer injection - Fix DDP reducer to correctly handle LoRA parameters

- Fix RowParallel/ColumnParallel LoRA input handling to match base module behavior - Add shape-based defensive checks for TP/SP consistency - Move TP/SP communication helper function declarations to utils.h - Move getter implementations from header to .cc file - Add unit test for SaveLoRAWeights/LoadLoRAWeights functionality Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

infini_train/src/nn/lora/lora_model.cc

infini_train/src/nn/lora/lora_parallel_linear.cc

infini_train/src/nn/lora/lora_model.cc

Chamberlain0w0 · 2026-03-02T09:04:17Z

infini_train/include/nn/lora/lora_parallel_linear.h

+// LoRA A: [rank, in_features] - replicated across TP ranks (implemented as Linear)
+// LoRA B: [out_features_per_partition, rank] - sharded like base weight (implemented as ColumnParallelLinear with
+// gather_output)
+class LoRAColumnParallelLinear : public nn::CloneableModule<LoRAColumnParallelLinear> {


感觉是不是可以继承自原 ColumnParallelLinear，篇幅上可以省一些基类的成员定义和 getter

这里不建议继承，因为LoRA 是在原 Linear 上叠加增量方法（典型decorator，LoRA 不管并行细节，把通信留给 base），不是一种新的 ColumnParallelLinear，此时组合优于继承。继承会让基类和 base_module_ 各自维护一套 weight / flags，容易不一致。

infini_train/include/nn/lora/lora_parallel_linear.h

Chamberlain0w0 · 2026-03-02T09:08:17Z

infini_train/src/nn/lora/lora_utils.cc

+            continue;
+        }
+
+        if (type == Linear::kType) {


这个文件里从这里开始有比较多这种三个 if 判断，但实际上就是一个 class name 的差异的代码，感觉可以采取一些更优雅的写法

这里最多只能再抽一个公共函数减少每个if分支里的内容，因为type是运行期确定的，不能再用模板简化分支数量了。但这里逻辑应该不会再有增量了，我认为可以接受

…ions

…r functions

chen2021673 and others added 5 commits February 12, 2026 09:11

fixup! feat(lora): add Low-Rank Adaptation support for efficient fine…

3a19529

…-tuning

fix(lora): improve LoRA configuration and DDP integration

28a7d88

- Refactor LoRA config construction with proper target module parsing - Add GetLoRAModel for in-place LoRA layer injection - Fix DDP reducer to correctly handle LoRA parameters

fixup! fix(lora): improve LoRA configuration and DDP integration

7cf2926

Chamberlain0w0 requested changes Mar 2, 2026

View reviewed changes

chen2021673 added 2 commits March 4, 2026 03:26

fixup! fix(lora): fix dimension mismatch and refactor TP helper funct…

d676ae5

…ions

fixup! fixup! fix(lora): fix dimension mismatch and refactor TP helpe…

439e75d

…r functions

kilinchange requested review from Chamberlain0w0, JYMiracle305 and kilinchange March 5, 2026 09:06

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Add LoRA (Low-Rank Adaptation) support for efficient model fine-tuning#108

feat: Add LoRA (Low-Rank Adaptation) support for efficient model fine-tuning#108
chen2021673 wants to merge 7 commits intomasterfrom
add_lora

chen2021673 commented Feb 12, 2026 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Chamberlain0w0 Mar 2, 2026

Uh oh!

chen2021673 Mar 4, 2026

Uh oh!

Uh oh!

Chamberlain0w0 Mar 2, 2026

Uh oh!

chen2021673 Mar 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

chen2021673 commented Feb 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Changes

New Features

Test Result

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Chamberlain0w0 Mar 2, 2026

Choose a reason for hiding this comment

Uh oh!

chen2021673 Mar 4, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Chamberlain0w0 Mar 2, 2026

Choose a reason for hiding this comment

Uh oh!

chen2021673 Mar 4, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

chen2021673 commented Feb 12, 2026 •

edited

Loading