Hacker News

Vera: Vector-Based Random Matrix Adaptation

by egnehotson 1/18/2024, 6:44:58 AM with 2 comments

by egnehotson 1/18/2024, 6:44:58 AM
VeRA makes LoRA ~10x more parameter efficient while retaining the same performance.
It's somewhat like a recursive LoRA scheme, where the LoRA A and B matrices are also decomposed using two small trainable vector parameters.
by code-masteron 1/18/2024, 10:59:41 AM
Is the code available anywhere