Neural Networks | Transcendent AI

Bringing Foundation Models to Small Data

This article explores TabPFN, a transformer-based foundation model designed for small tabular datasets. Trained on millions of synthetic datasets generated via structural causal models, TabPFN learns to predict labels through in-context learning. It outperforms traditional methods like CatBoost and XGBoost in both speed and accuracy, while offering robustness, interpretability, and fine-tuning capabilities. A breakthrough in tabular ML, it redefines what's possible on structu

Juan Manuel Ortiz de Zarate

Apr 1111 min read

1 view

Diffusion Models: From Noise to Masterpiece

Explore how Diffusion Models are revolutionizing generative AI, from their mathematical foundations to applications in image and audio.

Juan Manuel Ortiz de Zarate

Mar 208 min read

4 views

The Brains Behind AI’s Evolution

Discover how neural networks power modern AI, from deep learning to generative models, shaping the future of technology and innovation.

Juan Manuel Ortiz de Zarate

Mar 149 min read

2 views

Diffusion LLM: Closer to Human Thought

SEDD redefines generative AI with human-like reasoning, enabling faster, high-quality text and code through discrete diffusion models.

Juan Manuel Ortiz de Zarate

Mar 79 min read

25 views

AI That Thinks Before It Speaks

Optimizing AI reasoning with adaptive test-time computation using recurrent depth transformers for smarter, efficient problem-solving.

Juan Manuel Ortiz de Zarate

Feb 199 min read

1 view

Saving AI from Itself: How to Prevent Model Collapse

Active Inheritance curates synthetic data to control LLM behavior, preventing AI model collapse and improving diversity, safety, and bias.

Juan Manuel Ortiz de Zarate

Feb 68 min read

21 views

The fundamental weapon against overfitting

A detailed guide on regularization techniques (L1, L2, Elastic Net, Dropout, Early Stopping) to prevent overfitting in machine learning mode

Juan Manuel Ortiz de Zarate

Oct 16, 202410 min read

12 views

Orca: The New LLM Teacher

Orca 2: A smaller AI model that rivals larger ones by mastering task-specific reasoning, achieving high performance with less computation.

Juan Manuel Ortiz de Zarate

Oct 9, 20249 min read

26 views

Harnessing the Power of Bagging in Ensemble Learning

Boost your model's accuracy with bagging! Learn how ensemble techniques can stabilize predictions and improve performance.

Juan Manuel Ortiz de Zarate

Aug 7, 202410 min read

8 views

Biases in LLMs

Explore the hidden biases in LLMs and their impact. The opinions of which sector of society are reflected in them?

Juan Manuel Ortiz de Zarate

Jul 17, 202410 min read

22 views

Retrieval Augmented Generation: Increasing knowledge of your LLM

Dive into the world of Retrieval-Augmented Generation! See how RAG transforms AI responses by blending retrieval with generation.

Juan Manuel Ortiz de Zarate

May 25, 20249 min read

24 views

The Mathematics of Language

Computers model text with vectors. Using Word2Vec, FastText, and Transformers, they understand and generate context-aware text. Learn how!

Juan Manuel Ortiz de Zarate

May 25, 20248 min read

29 views

MLFlow + Hydra: A Framework for Experimentation with Python

In this article I share a experimentation framework I work with in my daily job. It uses MLFlow and Hydra to facilitate hypothesis testing.

Cristian Cardellino

May 23, 20249 min read

42 views

The Age of Digital Deception: The Dangers of Deep Fakes

Recent years have birthed a dangerous usage of the technology offered by neural networks, the deep fakes. We'll explore them in this article

Cristian Cardellino

Apr 27, 202413 min read

14 views

Generative Adversarial Networks (GANs): A Comprehensive Exploration

Let's explore Generative Adversarial Networks (GANs), one of the pioneers of Gen AI that was capable of producing photorealistic images.

Cristian Cardellino

Apr 10, 202411 min read

15 views

A Brief Introduction to Mixtures-of-Experts

In this article, we will explore the Mixture-of-Experts (MoE) and discuss the idea behind the gating mechanism used by the Sparse MoE.

Cristian Cardellino

Mar 26, 20248 min read

38 views

Variational Autoencoders: Intuitions and Math

In this article we will explore variational autoencoders in terms of the intuition behind them and the math needed to train them.

Cristian Cardellino

Mar 8, 20249 min read

33 views

A Brief Overview of Diffusion Models and Their Applications

In this article we will go through a brief review of diffusion models, how they work, what their usages are and what are the existing tools.

Cristian Cardellino

Feb 28, 20248 min read

38 views

Keeping Track of Experiments with MLFlow

This article introduces MLFlow and shows you how this great tool can help you boost your productivity in an experimentation environment.

Cristian Cardellino

Feb 16, 202411 min read

47 views

Beyond the Turing Test

Alan Turing, a British mathematician, logician, and computer scientist, played a seminal role in the development of artificial...

Claudio S. De Mutiis

Sep 17, 20238 min read

25 views