Deep learning

Discover how LayerMix, an innovative data augmentation technique using structured fractal mixing, enhances deep learning model robustness against corruptions, adversarial attacks, and distribution shifts. Learn about its methodology, benchmarks, and results.

LayerMix: A Fractal-Based Data Augmentation Strategy for More Robust Deep Learning Models

Introduction: The Quest for Robust AI Deep Learning (DL) has revolutionized computer vision, enabling machines to identify objects, segment images, and drive cars with astonishing accuracy. Yet, a critical Achilles’ heel remains: these models often fail dramatically when faced with data that deviates even slightly from their training set. A self-driving car trained on sunny-day […]

LayerMix: A Fractal-Based Data Augmentation Strategy for More Robust Deep Learning Models Read More »

Pixel-Level Concrete Crack Quantification: A Breakthrough in Structural Health Monitoring

Pixel-Level Concrete Crack Quantification: A Breakthrough in Structural Health Monitoring

Concrete cracks are more than just surface imperfections—they’re early warning signs of structural degradation that can compromise the safety and longevity of buildings, bridges, roads, and other critical infrastructure. Traditional inspection methods often rely on manual assessments, which are time-consuming, subjective, and prone to human error. However, recent advancements in computer vision and deep learning

Pixel-Level Concrete Crack Quantification: A Breakthrough in Structural Health Monitoring Read More »

Discrete Migratory Bird Optimizer with Transfer Learning Aided Multi-Retinal Disease Detection

Discrete Migratory Bird Optimizer with Deep Transfer Learning for Multi-Retinal Disease Detection

Retinal diseases such as diabetic retinopathy (DR), age-related macular degeneration (AMD), and glaucoma are leading causes of irreversible vision loss worldwide. Early detection is critical to preventing permanent blindness, yet manual diagnosis remains time-consuming and subjective. Recent advances in artificial intelligence have paved the way for automated, high-accuracy diagnostic systems. Among them, a groundbreaking approach—Discrete

Discrete Migratory Bird Optimizer with Deep Transfer Learning for Multi-Retinal Disease Detection Read More »

Anchor-Based Knowledge Distillation (AKD), a breakthrough in trustworthy AI for efficient model compression.

Anchor-Based Knowledge Distillation: A Trustworthy AI Approach for Efficient Model Compression

In the rapidly evolving field of artificial intelligence (AI), knowledge distillation (KD) has emerged as a cornerstone technique for compressing powerful, resource-intensive neural networks into smaller, more efficient models suitable for deployment on mobile and edge devices. However, traditional KD methods often fall short in capturing the full richness of a teacher model’s knowledge, especially

Anchor-Based Knowledge Distillation: A Trustworthy AI Approach for Efficient Model Compression Read More »

Diagram showing REM (Routing Entropy Minimization) applied to a Capsule Network, reducing unnecessary parse trees and focusing only on relevant object parts.

Capsule Networks Do Not Need to Model Everything: How REM Reduces Entropy for Smarter AI

In the fast-evolving world of deep learning, capsule networks (CapsNets) have emerged as a promising alternative to traditional convolutional neural networks (CNNs). Unlike CNNs, which lose spatial hierarchies due to pooling layers, CapsNets aim to preserve part-whole relationships through dynamic routing mechanisms. However, despite their biological inspiration and theoretical advantages, CapsNets often struggle with over-complication—modeling

Capsule Networks Do Not Need to Model Everything: How REM Reduces Entropy for Smarter AI Read More »

RoofSeg: An edge-aware transformer-based network for precise roof plane segmentation from LiDAR point clouds

RoofSeg: Revolutionizing Roof Plane Segmentation with Edge-Aware Transformers

RoofSeg: A Breakthrough in End-to-End Roof Plane Segmentation Using Transformers In the rapidly evolving field of 3D urban modeling and geospatial analysis, roof plane segmentation plays a pivotal role in reconstructing detailed building models at Levels of Detail (LoD) 2 and 3. Traditionally, this process has relied on manual feature engineering or post-processing techniques like

RoofSeg: Revolutionizing Roof Plane Segmentation with Edge-Aware Transformers Read More »

Visual representation of ACAM-KD framework showing student-teacher cross-attention and dynamic masking for improved knowledge distillation in object detection and segmentation.

ACAM-KD: Adaptive and Cooperative Attention Masking for Knowledge Distillation

In the rapidly evolving world of deep learning, deploying high-performance models on resource-constrained devices remains a critical challenge—especially for dense visual prediction tasks like object detection and semantic segmentation. These tasks are essential in real-time applications such as autonomous driving, video surveillance, and robotics. While large, deep neural networks deliver impressive accuracy, their computational demands

ACAM-KD: Adaptive and Cooperative Attention Masking for Knowledge Distillation Read More »

GeoSAM2 architecture diagram showing multi-view processing with SAM2 and LoRA modules.

GeoSAM2 3D Part Segmentation — Prompt-Controllable, Geometry-Aware Masks for Precision 3D Editing

In the rapidly evolving field of computer vision and 3D modeling, 3D part segmentation has emerged as a critical yet challenging task. Whether for robotic manipulation, 3D content generation, or interactive editing, accurately segmenting 3D objects into their constituent parts is essential. However, traditional methods often rely on extensive manual labeling, slow per-shape optimization, or lack fine-grained

GeoSAM2 3D Part Segmentation — Prompt-Controllable, Geometry-Aware Masks for Precision 3D Editing Read More »

A medical AI system using YOLOv8 and hyperparameter optimization to detect coronary artery stenosis in invasive coronary angiography images.

Hyperparameter Optimization of YOLO Models for Invasive Coronary Angiography Lesion Detection

Revolutionizing Cardiac Care: How Hyperparameter Optimization Boosts YOLO Accuracy in Coronary Lesion Detection Cardiovascular diseases remain the leading cause of death worldwide, with coronary artery disease (CAD) at the forefront. Early and accurate detection of coronary stenosis—narrowing of the arteries supplying the heart—is critical for timely intervention and improved patient outcomes. While invasive coronary angiography

Hyperparameter Optimization of YOLO Models for Invasive Coronary Angiography Lesion Detection Read More »

Diagram illustrating the FRIES framework for estimating inconsistency in saliency metrics across deep learning models and perturbations.

FRIES: A Groundbreaking Framework for Inconsistency Estimation of Saliency Metrics

Unlocking Trust in AI: Introducing FRIES – The First Framework for Inconsistency Estimation of Saliency Metrics As artificial intelligence (AI) becomes increasingly embedded in high-stakes domains like healthcare, finance, and autonomous systems, the need for explainable AI (XAI) has never been greater. One of the most widely used tools in XAI is the saliency map,

FRIES: A Groundbreaking Framework for Inconsistency Estimation of Saliency Metrics Read More »

Follow by Email
Tiktok