> Some interesting research has been done on the "loss landscape" of these giant... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		catlifeonmars on Feb 17, 2024 \| parent \| context \| favorite \| on: Training LLMs to generate text with citations via ... > Some interesting research has been done on the "loss landscape" of these giant neural network models, and my understanding is that they are messy and complicated. Do you have any recommended reading for this? It sounds like a super interesting area of research.

nerdponx on Feb 18, 2024 [–]

This is the one example I had in mind because of all the pretty pictures: https://arxiv.org/abs/1712.09913

Hao Li, Zheng Xu, Gavin Taylor, Christoph Studer, Tom Goldstein. 2018. "Visualizing the Loss Landscape of Neural Nets".

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact