Learning Vision-and-Language Navigation from YouTube Videos

Learning Vision-and-Language Navigation from YouTube Videos

ScreenAI: A Vision-Language Model for UI and Infographics UnderstandingПодробнее

ScreenAI: A Vision-Language Model for UI and Infographics Understanding

Cutting-Edge Machine Learning and Mathematical Theories in Vision-and-Language Navigation #shortsПодробнее

Cutting-Edge Machine Learning and Mathematical Theories in Vision-and-Language Navigation #shorts

Visual Perception Generalization for Vision and Language Navigation via Meta LearningПодробнее

Visual Perception Generalization for Vision and Language Navigation via Meta Learning

Visual Perception Generalization for Vision and Language Navigation via Meta LearningПодробнее

Visual Perception Generalization for Vision and Language Navigation via Meta Learning

History Enhanced and Order Aware Pre Training for Vision and Language NavigationПодробнее

History Enhanced and Order Aware Pre Training for Vision and Language Navigation

Vision and Language Navigation Based on Cross Modal Feature Fusion in Indoor EnvironmentПодробнее

Vision and Language Navigation Based on Cross Modal Feature Fusion in Indoor Environment

Vision and Language Navigation Based on Cross Modal Feature Fusion in Indoor EnvironmentПодробнее

Vision and Language Navigation Based on Cross Modal Feature Fusion in Indoor Environment

Grounded Entity-Landmark Adaptive Pre-Training for Vision-and-Language NavigationПодробнее

Grounded Entity-Landmark Adaptive Pre-Training for Vision-and-Language Navigation

History Enhanced and Order Aware Pre Training for Vision and Language NavigationПодробнее

History Enhanced and Order Aware Pre Training for Vision and Language Navigation

A New Path: Scaling Vision-and-Language Navigation with Synthetic Instructions & Imitation LearningПодробнее

A New Path: Scaling Vision-and-Language Navigation with Synthetic Instructions & Imitation Learning

PyAutoGUI - Locate anything on your screen | Simple Pyautogui projectПодробнее

PyAutoGUI - Locate anything on your screen | Simple Pyautogui project

Behavioral Analysis of Vision-and-Language Navigation AgentsПодробнее

Behavioral Analysis of Vision-and-Language Navigation Agents

[CVPR 2021 VQA2VLN Tutorial] Introduction to Vision Language NavigationПодробнее

[CVPR 2021 VQA2VLN Tutorial] Introduction to Vision Language Navigation

Visual Perception Generalization for Vision and Language Navigation via Meta LearningПодробнее

Visual Perception Generalization for Vision and Language Navigation via Meta Learning

CoRL 2020, Spotlight Talk 142: Sim-to-Real Transfer for Vision-and-Language NavigationПодробнее

CoRL 2020, Spotlight Talk 142: Sim-to-Real Transfer for Vision-and-Language Navigation

[CVPR 2023] Meta-Explore: Exploratory Hierarchical Vision-and-Language Navigation Using Scene ObjectПодробнее

[CVPR 2023] Meta-Explore: Exploratory Hierarchical Vision-and-Language Navigation Using Scene Object

History Enhanced and Order Aware Pre Training for Vision and Language NavigationПодробнее

History Enhanced and Order Aware Pre Training for Vision and Language Navigation

LM-Nav: Robotic Navigation with Large Pre-Trained Models of Language, Vision, and Action (CoRL 2022)Подробнее

LM-Nav: Robotic Navigation with Large Pre-Trained Models of Language, Vision, and Action (CoRL 2022)

SASRA: Semantically-aware Spatio-temporal Reasoning Agent for Vision-and-Language NavigationПодробнее

SASRA: Semantically-aware Spatio-temporal Reasoning Agent for Vision-and-Language Navigation