Vision Transformers & Attention

Weak-Mamba-UNet: How CNN, ViT, and Visual Mamba Collaborate to Segment Medical Images from Scribbles

Leave a Comment / Machine Learning, Image Segmentation, Medical AI, Vision Transformers & Attention / Adnan Saeed

Weak-Mamba-UNet: How CNN, ViT, and Visual Mamba Collaborate to Segment Medical Images from Scribbles | AI Trend Blend AITrendBlend Machine Learning Computer Vision About Medical AI & Weakly-Supervised Learning · arXiv:2402.10887 · University of Oxford / Mianyang Visual Engineering Center · 25 min read Teaching Three Different Brains to Agree — How Weak-Mamba-UNet Segments Hearts […]

Weak-Mamba-UNet: How CNN, ViT, and Visual Mamba Collaborate to Segment Medical Images from Scribbles Read More »

Mamba-3: Three Simple Ideas That Finally Fix What Transformers Get Wrong at Inference

Leave a Comment / Machine Learning, Computer Vision, Vision Transformers & Attention / Adnan Saeed

Mamba-3: Three Simple Ideas That Finally Fix What Transformers Get Wrong at Inference | AI Trend Blend AITrendBlend Machine Learning NLP & LLMs About Efficient AI · arXiv:2603.15569 · CMU & Princeton · March 2026 · 22 min read Mamba-3: Three Simple Ideas That Finally Fix What Transformers Get Wrong at Inference Time Researchers at

Mamba-3: Three Simple Ideas That Finally Fix What Transformers Get Wrong at Inference Read More »

GateMamba: Feature Gated Mixer in State Space Model for Point Cloud 3D Object Detection

Leave a Comment / Machine Learning, Computer Vision, Remote Sensing AI, Vision Transformers & Attention / Adnan Saeed

GateMamba: Feature Gated Mixer in State Space Model for Point Cloud 3D Object Detection | AI Trend Blend AITrendBlend Machine Learning Computer Vision About Autonomous Driving AI · ISPRS Journal of Photogrammetry and Remote Sensing 236 (2026) 640–653 · 22 min read GateMamba: How Three Gated Mixers Taught a Mamba Network to Stop Ignoring Cyclists

GateMamba: Feature Gated Mixer in State Space Model for Point Cloud 3D Object Detection Read More »

The Moon’s Many Faces: A Single Unified Transformer for Multimodal Lunar Reconstruction

Leave a Comment / Machine Learning, Computer Vision, Multimodal AI, Remote Sensing AI, Vision Transformers & Attention / Adnan Saeed

The Moon’s Many Faces: A Single Unified Transformer for Multimodal Lunar Reconstruction | AI Trend Blend AITrendBlend Machine Learning Computer Vision About Planetary AI & 3D Reconstruction · ISPRS J. Photogramm. Remote Sens. 236 (2026) 363–379 · TU Dortmund University · 26 min read The Moon’s Many Faces: How One Transformer Learned to Speak All

The Moon’s Many Faces: A Single Unified Transformer for Multimodal Lunar Reconstruction Read More »

Fusion-Mamba: Hidden State Space Fusion for Cross-Modality Object Detection

Leave a Comment / Machine Learning, Computer Vision, Multimodal AI, Vision Transformers & Attention / Adnan Saeed

Fusion-Mamba: Hidden State Space Fusion for Cross-Modality Object Detection | AI Trend Blend AITrendBlend Machine Learning Computer Vision About Computer Vision · arXiv:2404.09146 · Beihang University · 21 min read Mamba Goes Multimodal: How Fusion-Mamba Built a Hidden State Space to End Modality Disparity Researchers at Beihang University asked what happens when you stop treating

Fusion-Mamba: Hidden State Space Fusion for Cross-Modality Object Detection Read More »

BGPANet: How Bi-Granular Progressive Attention Cracked the Skin Cancer Diagnosis Problem

Leave a Comment / Machine Learning, Computer Vision, Medical AI, Vision Transformers & Attention / Adnan Saeed

BGPANet: How Bi-Granular Progressive Attention Cracked the Skin Cancer Diagnosis Problem | AI Medical Research AIMedical Research Machine Learning Medical AI About Medical Image AI · Expert Systems With Applications 321 (2026) 132169 · 16 min read BGPANet: The Bi-Granular Attention Breakthrough That Finally Taught AI to Diagnose Skin Cancer Like a Dermatologist How a

BGPANet: How Bi-Granular Progressive Attention Cracked the Skin Cancer Diagnosis Problem Read More »

CFFormer: Cross CNN-Transformer Attention Model

CFFormer: How Cross CNN-Transformer Attention Finally Solves the Blurry Ultrasound Problem

1 Comment / Machine Learning, Image Segmentation, Medical AI, Vision Transformers & Attention / Adnan Saeed

CFFormer: How Cross CNN-Transformer Attention Finally Solves the Blurry Ultrasound Problem | AI Trend Blend AITrendBlend Machine Learning Computer Vision Medical AI About Medical Image Segmentation · Expert Systems with Applications · 2025 · 24 min read CFFormer: How Cross CNN-Transformer Attention Finally Solves the Blurry Ultrasound Problem Researchers at University of Nottingham Ningbo built

CFFormer: How Cross CNN-Transformer Attention Finally Solves the Blurry Ultrasound Problem Read More »

PraNet-V2: Dual-Supervised Reverse Attention for Medical Image Segmentation

Leave a Comment / Machine Learning, Image Segmentation, Medical AI, Vision Transformers & Attention / Adnan Saeed

PraNet-V2: Dual-Supervised Reverse Attention for Medical Image Segmentation | AI Trend Blend AITrendBlend Machine Learning Computer Vision About Medical Computer Vision · Computational Visual Media (2026) · 18 min read PraNet-V2: How Dual-Supervised Reverse Attention Finally Fixes Background Blindness in Medical Segmentation Researchers at Nankai University tore apart the reverse attention mechanism they invented five

PraNet-V2: Dual-Supervised Reverse Attention for Medical Image Segmentation Read More »