Vision-Dialog Navigation by Exploring Cross-Modal Memory

Vision-Dialog Navigation by Exploring Cross-Modal Memory

Vision-Language Navigation With Self-Supervised Auxiliary Reasoning TasksПодробнее

Vision-Language Navigation With Self-Supervised Auxiliary Reasoning Tasks

Vision-and-Dialog NavigationПодробнее

Vision-and-Dialog Navigation

IMRAM: Iterative Matching With Recurrent Attention Memory for Cross-Modal Image-Text RetrievalПодробнее

IMRAM: Iterative Matching With Recurrent Attention Memory for Cross-Modal Image-Text Retrieval

Audio Clustering Explained with a DEMOПодробнее

Audio Clustering Explained with a DEMO

The Ex-Uber Data Scientist Who wants to simplify Data Science with Serverless ComputingПодробнее

The Ex-Uber Data Scientist Who wants to simplify Data Science with Serverless Computing

Cooperative Vision-and-Dialog NavigationПодробнее

Cooperative Vision-and-Dialog Navigation

Case study usBIM.geotwin: Multi-user navigationПодробнее

Case study usBIM.geotwin: Multi-user navigation

Active visual information gathering for vision language navigationПодробнее

Active visual information gathering for vision language navigation

Learning Visuomotor Policies for Aerial Navigation Using Cross-Modal RepresentationsПодробнее

Learning Visuomotor Policies for Aerial Navigation Using Cross-Modal Representations

NEW LLM & Knowledge-Graph Fusion: GIVE (UC Berkeley, Penn)Подробнее

NEW LLM & Knowledge-Graph Fusion: GIVE (UC Berkeley, Penn)

Paper explanation of Soft Expert Reward Learning for Vision-and-Language NavigationПодробнее

Paper explanation of Soft Expert Reward Learning for Vision-and-Language Navigation

4Seasons: A Cross-Season Dataset for Multi-Weather SLAM in Autonomous DrivingПодробнее

4Seasons: A Cross-Season Dataset for Multi-Weather SLAM in Autonomous Driving

Smart Navigation - How AI Robots Understand and Explore EnvironmentsПодробнее

Smart Navigation - How AI Robots Understand and Explore Environments

UNMuTe: Unifying Navigation and Multimodal Dialogue-like Text Generation - ArXiv:2408.04Подробнее

UNMuTe: Unifying Navigation and Multimodal Dialogue-like Text Generation - ArXiv:2408.04

UNMuTe: Unifying Navigation and Multimodal Dialogue-like Text Generation - ArXiv:2408.04Подробнее

UNMuTe: Unifying Navigation and Multimodal Dialogue-like Text Generation - ArXiv:2408.04