Fix llama's lm_head.weight.requires_grad (#330)
By default, `llama's lm_head.weight.requires_grad` was True, but we expect it to be False.pull/332/head
parent
7a37513f77
commit
47a2b1ee65
Loading…
Reference in New Issue