Research

The Trustworthy Knowledge-Driven AI (TKAI) Lab develops stable, explainable, and trustworthy AI systems by integrating ideas from neuroscience, formal language theory, control theory, information theory, scientific machine learning, and cognitive science. Our research combines mathematical foundations and empirical validation with the goal of building AI systems that learn from limited data, remain computationally efficient, generalize beyond training distributions, and provide transparent reasoning.

Our work spans five major directions:

Brain-inspired learning
Neural automata and theoretical AI
Neuro-symbolic reasoning and trustworthy language models
AI for scientific discovery
AI for health, biology, and vision

Brain-Inspired Learning and Predictive Coding

At TKAI Lab, we study learning algorithms inspired by biological intelligence. A central goal is to move beyond purely global backpropagation-based training by developing local, stable, and energy-based learning mechanisms.

Predictive Coding
OverviewPublications
Predictive coding models learning as an iterative process in which internal states are adjusted to reduce local prediction errors. We study predictive coding as an alternative framework for scalable deep learning, continual learning, robustness, and energy-based inference. Our work develops stability, convergence, and robustness guarantees for predictive coding networks, while also exploring their relationship to Minimum Description Length (MDL), active inference, and biologically plausible learning.
Local Representation Alignment
OverviewPublications
Local Representation Alignment (LRA) replaces global error propagation with locally defined target representations and feedback pathways. This line of work studies how deep and recurrent networks can learn using local objectives, error-feedback matrices, and layer-wise alignment mechanisms. We use LRA to understand alternatives to backpropagation in feedforward networks, recurrent models, compression systems, and continual learning.
Backpropagation-Free Reinforcement Learning
OverviewPublications
We explore reinforcement learning systems that use active neural generative coding and predictive coding instead of standard backpropagation through time. These models are motivated by the idea that agents can learn through local predictive objectives, internal state correction, and action-conditioned generative dynamics. This research connects predictive coding, active inference, and reinforcement learning in sparse-reward robotic control settings.
- Active Predictive Coding: Brain-Inspired Reinforcement Learning for Sparse Reward Robotic Control Problems (2023)
  ICRA
- Backprop-Free Reinforcement Learning with Active Neural Generative Coding (2022)
  AAAI
Neuro-Mimetic and Continual Learning
OverviewPublications
We study how neural systems can learn continually without catastrophic forgetting. This includes predictive coding models, self-organizing maps, local learning, sparse representations, and task-free online learning. The broader goal is to design learning systems that remain adaptive over time while preserving previously acquired knowledge.

Neural Automata, Formal Languages, and Computational Theory

We develop theoretical foundations for understanding what neural networks can learn under finite precision, finite time, and practical computational constraints. This work connects recurrent networks, differentiable memory, automata theory, formal languages, and computational complexity.

Memory-Augmented Neural Networks
OverviewPublications
Transformers and recurrent networks often struggle to generalize to sequences longer than those seen during training. Our work develops memory-augmented neural networks, including differentiable stack and tape-based models, to study how neural architectures can recognize formal languages, simulate automata, and perform structured computation under finite precision and time.
- A Provably Stable Neural Network Turing Machine with Finite Precision and Time (2024)
  Information Sciences
- Exploring Learnability in Memory-Augmented Recurrent Neural Networks: Precision, Stability, and Empirical Insights (2024)
  arxiv
- Recognizing Long Grammatical Sequences Using Recurrent Networks Augmented with an External Differentiable Stack (2021)
  PMLR
- The Neural State Pushdown Automata (2021)
  IEEE Transactions on Artificial Intelligence
Formal Languages and Neural Learnability
OverviewPublications
We study recurrent neural networks through the lens of formal language recognition. This includes counter languages, Dyck languages, mathematical equation verification, and symbolic rule extraction. Our goal is to characterize when neural models can learn structured computation and when they collapse to simpler finite-state behavior.
Realizable Circuit Complexity
OverviewPublications
Standard circuit complexity often abstracts away physical constraints such as spatial embedding, communication delay, locality, and energy dissipation. We develop Realizable Circuit Complexity as a framework for understanding computation under physical and architectural constraints. This work provides a new lens for analyzing the limitations of attention, long-range computation, and scalable neural reasoning systems.
- Realizable Circuit Complexity: Embedding Computation in Space-Time (2025)
  arxiv
Finite Precision and Computational Limits
OverviewPublications
Neural networks are often analyzed as real-valued systems, but practical computation occurs under bounded precision and finite time. We study how these constraints change the expressivity, stability, and learnability of neural architectures. This research asks when neural models can approximate symbolic computation and when numerical constraints force them into simpler computational regimes.

Neuro-Symbolic AI and Trustworthy Language Models

We develop neuro-symbolic methods that combine neural representations with symbolic structure, automata-inspired memory, and interpretable reasoning. This work focuses on making language models more reliable, controllable, and transparent.

Local RetoMaton and Neuro-Symbolic Reasoning
OverviewPublications
Local RetoMaton extends retrieval-based language modeling with local, structured, automata-inspired memory. Instead of relying only on chain-of-thought prompting or in-context examples, this approach introduces traceable memory paths that can support more transparent and controllable reasoning. Our goal is to develop language models that can expose the reasoning structures they use and allow users to inspect, edit, and verify them.
LLM Generalization, Capacity, and MDL
OverviewPublications
We study large language model capacity, generalization, and optimization through Minimum Description Length, curvature analysis, influence-based layer allocation, and information-theoretic learning principles. This work aims to move model selection and adaptation from heuristic procedures toward principled optimization objectives.
- Curvature-Weighted Capacity Allocation: A Minimum Description Length Framework for Layer-Adaptive Large Language Model Optimization (2026)
  arxiv
Educational and Human-Centered LLM Agents
OverviewPublications
We design LLM-based agents for education and human-centered reasoning. This includes teacher-student agent systems, retrieval-augmented learning environments, learning-style adaptation, and ethically grounded decision-making. The goal is to make AI agents useful, transparent, and safe in settings where human learning and trust are central.
- Investigating Pedagogical Teacher and Student LLM Agents: Genetic Adaptation Meets Retrieval Augmented Generation Across Learning Style (2025)
  EMNLP
- Ethical Decision-Making in the Age of AI: An Analysis of Large Language Models (2026)
  ACDSA

Robust, Safe, and Efficient Deep Learning

We develop mathematical and algorithmic tools for improving the stability, robustness, safety, and efficiency of deep learning systems.

Activation Functions and Stability Theory
OverviewPublications
Activation functions shape gradient flow, representation geometry, stability, and optimization dynamics. We study activation functions through theoretical and empirical lenses, including smoothness, asymptotic growth, Gaussian propagation, and stability signatures. This line of work includes the TeLU activation function and a broader 9-dimensional taxonomy for activation-induced behavior in deep networks.
- TeLU Activation Function for Fast and Stable Deep Learning (2025)
  arxiv
- Integral Signatures of Activation Functions: A 9-Dimensional Taxonomy and Stability Theory for Deep Learning (2025)
  arxiv
Information-Theoretic and MDL-Based Learning
OverviewPublications
We develop learning objectives based on information theory, free energy, and Minimum Description Length. These methods aim to improve robustness, calibration, and model selection by penalizing overly complex explanations and rare but severe failure modes. Surprisal-Rényi Free Energy provides a risk-sensitive objective that interpolates between different divergence behaviors.
Robustness, Continual Learning, and Unlearning
OverviewPublications
Reliable AI systems must adapt, forget, and remain robust under distribution shifts and adversarial conditions. We study continual learning, model unlearning, concept erasure, and robustness in generative models. This research develops evaluation methods and divergence-driven objectives for distinguishing true unlearning from concealment or superficial suppression.

AI for Scientific Discovery

We use AI to accelerate scientific discovery in biology, geoscience, physics, and engineering. Our emphasis is on models that combine data-driven learning with scientific structure, domain knowledge, and interpretable constraints.

Enzyme Kinetics and Protein Function Prediction
OverviewPublications
Enzyme-substrate interactions depend on protein sequence, molecular structure, substrate chemistry, and local binding geometry. We develop machine learning models for enzyme kinetics and catalytic variability using protein language models, substrate representations, and graph-based learning. This research supports biological discovery, metabolic engineering, and enzyme design.
- Robust Prediction of Enzyme Variant Kinetics with RealKcat (2025)
  bioRxiv
- Empirical and Computational Advances in Enzyme Kinetics Prediction: A Machine Learning Approach to Catalytic Variability (2025)
  AIChE Annual Meeting
Physics-Guided Machine Learning for Earthquake Prediction
OverviewPublications
We develop physics-guided machine learning models for laboratory earthquake prediction and fault-zone monitoring. This work combines active-source ultrasonic measurements, physics-informed learning, and geophysical domain knowledge to model frictional failure, permeability evolution, and acoustic precursors to lab earthquakes.
- Using a Physics-Informed Neural Network and Fault Zone Acoustic Monitoring to Predict Lab Earthquakes (2023)
  Nature Communications
- Physics-Guided Machine Learning for Laboratory Earthquake Prediction (2023)
  EGU General Assembly
- A Physics-Informed Machine Learning Model for Lab Earthquake Prediction Using Time-Lapse Active Source Ultrasonic Data (2022)
  AGU Fall Meeting
Geoscience, Permeability, and Seismo-THMC Systems
OverviewPublications
We apply machine learning to geophysical systems involving permeability evolution, microearthquakes, and coupled seismo-thermo-hydro-mechanical-chemical processes. This work aims to connect seismic signals, laboratory measurements, and physical mechanisms in order to better understand subsurface dynamics and enhanced geothermal systems.
- Crustal Permeability Generated Through Microearthquakes Is Constrained by Seismic Moment (2024)
  Nature Communications
- Evolution of Crustal Permeability as a Coupled Seismo-THMC Process (2025)
  AGU Fall Meeting

AI for Health, Vision, and Multimodal Learning

We develop AI systems for biomedical interpretation, visual reasoning, image compression, human motion prediction, and multimodal learning.

AI for Health and Clinical Interpretation
OverviewPublications
We study how language models and chain-of-thought prompting can support clinical interpretation and patient-centered communication. This includes work on pathology report summarization, cancer report interpretation, and safe deployment of AI systems in health-related workflows.
- Chain-of-Thought for Cancer Pathology Interpretation and Summarization (2025)
  SMARTCOMP
Microscopy and Human-in-the-Loop Vision-Language Models
OverviewPublications
We study human-in-the-loop methods for selecting prompts and improving microscopy vision-language models. This research focuses on making AI-assisted microscopy more efficient, interpretable, and useful for scientific image analysis.
- A Human-in-the-Loop Framework for Efficient Prompt Selection in Microscopy Vision-Language Models (2026)
  arxiv
Image Compression and Neural Decoding
OverviewPublications
We develop neural methods for image compression and iterative decoding. This includes recurrent learning algorithms, neural JPEG systems, and gradient communication between estimators. The broader goal is to improve compression efficiency while preserving reconstruction quality and computational tractability.
Computer Vision and Visually Grounded Learning
OverviewPublications
We study visual representation learning, visually grounded language acquisition, human motion prediction, and 3D room layout reconstruction. This work connects perception, language, geometry, and temporal modeling.

Patents and Technology Transfer

Neural Network Training Technology
OverviewPublications
Our research has contributed to patented technology on alternative methods for training neural networks. This reflects the translational potential of biologically motivated and backpropagation-free learning methods for real-world AI systems.
- Novel Method of Training a Neural Network, US Patent Application 17/520,448 (2023)
  US Patent Application

Additional Research Directions

High Performance Computing

OverviewPublications

We are exploring GPU-parallel algorithms for graph search and scientific computing. This includes parallel implementations of Bounded Multi-Source Shortest Path methods designed to decompose large search problems into independent subproblems suitable for modern accelerators.

Work in progress.
Control Systems

OverviewPublications

We study optimization-based approaches for adaptive control, including gradient-descent-based MRAC and higher-order extensions. This work connects control theory, stability analysis, and learning-based adaptation.

Work in progress.