top of page

Search


Bringing Foundation Models to Small Data
This article explores TabPFN, a transformer-based foundation model designed for small tabular datasets. Trained on millions of synthetic datasets generated via structural causal models, TabPFN learns to predict labels through in-context learning. It outperforms traditional methods like CatBoost and XGBoost in both speed and accuracy, while offering robustness, interpretability, and fine-tuning capabilities. A breakthrough in tabular ML, it redefines what's possible on structu

Juan Manuel Ortiz de Zarate
Apr 1111 min read
1 view


Diffusion Models: From Noise to Masterpiece
Explore how Diffusion Models are revolutionizing generative AI, from their mathematical foundations to applications in image and audio.

Juan Manuel Ortiz de Zarate
Mar 208 min read
4 views


The Brains Behind AI’s Evolution
Discover how neural networks power modern AI, from deep learning to generative models, shaping the future of technology and innovation.

Juan Manuel Ortiz de Zarate
Mar 149 min read
2 views


Diffusion LLM: Closer to Human Thought
SEDD redefines generative AI with human-like reasoning, enabling faster, high-quality text and code through discrete diffusion models.

Juan Manuel Ortiz de Zarate
Mar 79 min read
25 views


AI That Thinks Before It Speaks
Optimizing AI reasoning with adaptive test-time computation using recurrent depth transformers for smarter, efficient problem-solving.

Juan Manuel Ortiz de Zarate
Feb 199 min read
1 view


Saving AI from Itself: How to Prevent Model Collapse
Active Inheritance curates synthetic data to control LLM behavior, preventing AI model collapse and improving diversity, safety, and bias.

Juan Manuel Ortiz de Zarate
Feb 68 min read
21 views


The fundamental weapon against overfitting
A detailed guide on regularization techniques (L1, L2, Elastic Net, Dropout, Early Stopping) to prevent overfitting in machine learning mode

Juan Manuel Ortiz de Zarate
Oct 16, 202410 min read
12 views


Orca: The New LLM Teacher
Orca 2: A smaller AI model that rivals larger ones by mastering task-specific reasoning, achieving high performance with less computation.

Juan Manuel Ortiz de Zarate
Oct 9, 20249 min read
26 views


Harnessing the Power of Bagging in Ensemble Learning
Boost your model's accuracy with bagging! Learn how ensemble techniques can stabilize predictions and improve performance.

Juan Manuel Ortiz de Zarate
Aug 7, 202410 min read
8 views


Biases in LLMs
Explore the hidden biases in LLMs and their impact. The opinions of which sector of society are reflected in them?

Juan Manuel Ortiz de Zarate
Jul 17, 202410 min read
22 views


Retrieval Augmented Generation: Increasing knowledge of your LLM
Dive into the world of Retrieval-Augmented Generation! See how RAG transforms AI responses by blending retrieval with generation.

Juan Manuel Ortiz de Zarate
May 25, 20249 min read
24 views


The Mathematics of Language
Computers model text with vectors. Using Word2Vec, FastText, and Transformers, they understand and generate context-aware text. Learn how!

Juan Manuel Ortiz de Zarate
May 25, 20248 min read
29 views


MLFlow + Hydra: A Framework for Experimentation with Python
In this article I share a experimentation framework I work with in my daily job. It uses MLFlow and Hydra to facilitate hypothesis testing.
Cristian Cardellino
May 23, 20249 min read
42 views


The Age of Digital Deception: The Dangers of Deep Fakes
Recent years have birthed a dangerous usage of the technology offered by neural networks, the deep fakes. We'll explore them in this article
Cristian Cardellino
Apr 27, 202413 min read
14 views


Generative Adversarial Networks (GANs): A Comprehensive Exploration
Let's explore Generative Adversarial Networks (GANs), one of the pioneers of Gen AI that was capable of producing photorealistic images.
Cristian Cardellino
Apr 10, 202411 min read
15 views


A Brief Introduction to Mixtures-of-Experts
In this article, we will explore the Mixture-of-Experts (MoE) and discuss the idea behind the gating mechanism used by the Sparse MoE.
Cristian Cardellino
Mar 26, 20248 min read
38 views


Variational Autoencoders: Intuitions and Math
In this article we will explore variational autoencoders in terms of the intuition behind them and the math needed to train them.
Cristian Cardellino
Mar 8, 20249 min read
33 views


A Brief Overview of Diffusion Models and Their Applications
In this article we will go through a brief review of diffusion models, how they work, what their usages are and what are the existing tools.
Cristian Cardellino
Feb 28, 20248 min read
38 views


Keeping Track of Experiments with MLFlow
This article introduces MLFlow and shows you how this great tool can help you boost your productivity in an experimentation environment.
Cristian Cardellino
Feb 16, 202411 min read
47 views


Beyond the Turing Test
Alan Turing, a British mathematician, logician, and computer scientist, played a seminal role in the development of artificial...

Claudio S. De Mutiis
Sep 17, 20238 min read
25 views
bottom of page