Recent Posts

SAEdit: Token-level control for continuous image editing via Sparse AutoEncoder

arXiv:2510.05081v1 Announce Type: cross Abstract: Large-scale text-to-image diffusion models have become the backbone of modern image editing, yet text prompts alone do not offer adequate control over the editing process. Two properties are especially desirable: disentanglement, where changing one attribute does not unintentionally alter others, and continuous control, where the strength of an edit can be smoothly adjusted. We introduce …

Read More »

[2405.14715] Towards Cross-modal Backward-compatible Representation Learning for Vision-Language Models

[Submitted on 23 May 2024 (v1), last revised 6 Oct 2025 (this version, v3)] View a PDF of the paper titled Towards Cross-modal Backward-compatible Representation Learning for Vision-Language Models, by Young Kyun Jang and 1 other authors View PDF HTML (experimental) Abstract:Modern retrieval systems often struggle with upgrading to new and more powerful models due to the incompatibility of embeddings …

Read More »

How AGI is building the tech for AI agents to book travel

The rise of artificial intelligence (AI) agents is poised to transform how travelers search, plan and book trips. Instead of clicking through endless websites and juggling multiple tabs, autonomous AI agents could soon handle the entire process—navigating booking flows, completing payments, applying loyalty points and even personalizing options based on the user’s preferences.  San Francisco-based startup AGI, Inc. is reimagining …

Read More »