Compiling Source Model Tutorial

Deep Multi-Source Visual Fusion With Transformer Model for Video Content Filtering

Abstract: As YouTube content continues to grow, advanced filtering systems are crucial to ensuring a safe and enjoyable user experience. We present MFusTSVD, a multi-modal model for classifying ...

IEEE

Sound Source Localization Using Multi-Dictionary Orthogonal Matching Pursuit in Reverberant Environments

Abstract: Sound source localization in reverberant environments remains a challenging problem, particularly when precise position estimation is required. Existing DOA estimation methods, while ...

GitHub

HunyuanVideo: A Systematic Framework For Large Video Generation Model

We present HunyuanVideo, a novel open-source video foundation model that exhibits performance in video generation that is comparable to, if not superior to, leading closed-source models. In order to ...

Wall Street Journal

Nvidia Is Developing an AI Healthcare Model With Startup Abridge

Nvidia is joining with Abridge, maker of an AI note-taking app for doctors, to train an artificial intelligence model tailored for healthcare as it continues its push into the sector. The AI model is ...

Microsoft

RepoLaunch: Automating Build&Test Pipeline of Code Repositories on ANY Language and ANY Platform

Building software repositories typically requires significant manual effort. Recent advances in large language model (LLM) agents have accelerated automation in software engineering (SWE). We ...

VentureBeat

Researchers trained an open source AI search agent, Harness-1, that outperforms GPT-5.4 on recalling relevant information

A joint research collaboration between researchers at the University of Illinois at Urbana-Champaign (UIUC), UC Berkeley, and the open source AI-native vector database platform Chroma unveiled Harness ...

CNBC

Model routing is a fix for AI overspending. That's a problem for OpenAI and Anthropic

Companies are shifting from running everything on the most powerful AI model to matching each task to the right one, a practice called model routing. The pressure for efficiency comes as large ...

GitHub

Ideogram 4: Open image model at the forefront of design

Ideogram 4 is Ideogram's first open-weight text-to-image model. It is a state-of-the-art foundation model trained from scratch — not a fine-tune of any existing model. It introduces a new structured ...

VentureBeat

Google's new open source Gemma 4 12B analyzes audio, video — and runs entirely locally on a typical 16GB enterprise laptop

Credit: VentureBeat made with OpenAI ChatGPT-Images-2.0 While many AI open source model providers are pursuing larger and more powerful models, Google is still giving attention to the smaller, more ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results