Vision Transformer Encoder/Decoder

GCP-VQVAE: A Geometry-Complete Language for Protein 3D Structure

Converting protein tertiary structure into discrete tokens via vector-quantized variational autoencoders (VQ-VAEs) creates a language of 3D geometry and provides a natural interface between sequence ...

WinBuzzer

Google DeepMind Launches D4RT AI Model for Real-Time 4D Reconstruction

Google DeepMind has released D4RT, a unified AI model for 4D scene reconstruction that runs 18 to 300 times faster than ...

Scientific Research Publishing

Geo-Refined Point Transformer: Coordinate-Aware Excitation and Positional Upsampling for 3D Scene Segmentation ()

The proposed Coordinate-Aware Feature Excitation (CAFE) module and Position-Aware Upsampling (Pos-Up) module both adhere to ...

TMCnet

Rail Vision: Quantum Transportation Delivers First Transformer-Based Neural Decoder for Universal Quantum Error Correction

Ra’anana, Israel, Jan. 15, 2026 (GLOBE NEWSWIRE) -- Rail Vision Ltd. (Nasdaq: RVSN) (“Rail Vision” or the “Company”), an early commercialization stage technology company seeking to revolutionize ...

15d

Show inaccessible results

GCP-VQVAE: A Geometry-Complete Language for Protein 3D Structure

Google DeepMind Launches D4RT AI Model for Real-Time 4D Reconstruction

Geo-Refined Point Transformer: Coordinate-Aware Excitation and Positional Upsampling for 3D Scene Segmentation ()

Rail Vision: Quantum Transportation Delivers First Transformer-Based Neural Decoder for Universal Quantum Error Correction

New Apple model combines vision understanding and image generation with impressive results

Transformer encoder architecture explained simply

These 20- and 22-year-olds raised $5M from YC, General Catalyst to study online behavior using vision AI