Recent Posts

[D] Why does BYOL/JEPA like models work? How does EMA prevent model collapse?

I am curious on your takes on BYOL/JEPA like training methods and the intuitions/mathematics behind why the hell does it work? From an optimization perspective, without the EMA parameterization of the teacher model, the task would be very trivial and it would lead to model collapse. However, EMA seems to avoid this. Why? Specifically: How can a network learn semantic …

Read More »

Fed responds to Trump effort to fire Lisa Cook

🔥 Trending now in : Fed responds to Trump effort to fire Lisa Cook Jerome Powell, chairman of the US Federal Reserve, left, and Lisa Cook, governor of the US Federal Reserve, during the Federal Reserve Board open meeting in Washington, DC, US, on Wednesday, June 25, 2025. Al Drago | Bloomberg | Getty Images The Federal Reserve on Tuesday …

Read More »