Work in Progress
Until Friday, I was staying at Google DeepMind Japan (as work from anywhere) and join end year party of Masason Foundation, so my participation in NeurIPS was limited to the two days of workshops on Saturday and Sunday. Therefore, I would like to focus mainly on these two days in my summary.
Sabyasachi Sahoo / ULaval+Mila
Kazuto Fukuchi / UTsukuba
I will present a work "Mastering Task Arithmetic: τJp as a Key Indicator for Weight Disentanglement" at the #NeurIPS2024 workshop FITML on 14th Dec.
— Kotaro Yoshida @NeurIPS2024 (@katoro13___) December 12, 2024
We mitigate the interference between task vectors and coefficient tuning costs through the "τJp" regularization. pic.twitter.com/weaj9Ba75v
WIP
I will present our work on the efficient distributed training algorithm at the optimization workshop. Join us during our poster sessions from 15:00-16:00.#NeurIPS2024 pic.twitter.com/D4nA8hWX4u
— Hiroki Naganuma (@_Hiroki11x) December 15, 2024
SOAP: Improving and Stabilizing Shampoo using Adam
μLO: Compute-Efficient Meta-Generalization of Learned Optimizers
WIP
I want to thank Microsoft Research (YVR->SFO) and Masason Foundation (SFO->NRT->YVR) for supporting my participation in the NeurIPS.