> Some interesting research has been done on the "loss landscape" of these giant neural network models, and my understanding is that they are messy and complicated.
Do you have any recommended reading for this? It sounds like a super interesting area of research.
Do you have any recommended reading for this? It sounds like a super interesting area of research.