Marcel Castro

The Model Context Protocol - What and How to run in Snowflake

LLM, MCP, Snowflake

Introduction to MCP and how to run it in Snowflake.

May 29, 2025

DeepSeek-R1 and the aha-moment

LLM, RL, Reasoning

DeepSeek-R1 and the aha-moment - a brief description of DeepSeek-R1 achievements.

Jan 29, 2025

The Power of Focus: Understanding Attention Mechanisms in LLM

transformers, LLM

Large Language Model Optimization techniques - an overview of attention mechanisms

Jan 3, 2025

What KL Divergence has to do with Large Language Model

Regularization, LLM, transformers, DPO, SFT, RLHF

KL Divergence and its role on Large Language Model Training.

Nov 4, 2024

AI Agents Revolution

LLM, Agents

High level overview of AI Agents and evolutions.

Oct 23, 2024

Thinking Tokens

LLM, transformers, Tokens

Concept of thinking tokens to improve model performance while reasoning.

Oct 1, 2024

Overview of LLM Optimizations Techniques

machine_learning, transformers, LLM

Overview of current Large Language Model Optimization techniques for both inference and training

May 25, 2024

Understanding Mixture of Experts

sagemaker

transformers

NLP

MoE

Details on Mixture of Experts and how to run it.

May 19, 2024

Flash Attention - Fast and Memory Efficient Attention Mechanism

The attention layer is the main bottleneck in scaling longer sequences in LLM (Large Language Models), as its runtime and memory increase quadratically in the sequence…

Apr 7, 2024

How Attention Mechanism works in Transformers

NLP

transfomers

PyTorch

Simple implementation of self-attention using PyTorch.

Apr 1, 2024

Text Moderation - Toxicity Classification using Amazon Comprehend API

This notebook will capture different methods of performing text moderation, in special with focus on toxicity classification. It will be divided into 3 parts:

Mar 26, 2024

Train Falcon with near-linear scaling using Sharded Data Parallelism technique in SageMaker Model Parallelism Library

This notebook’s CI test result for us-west-2 is as follows. CI test results in other regions can be found at the end of the notebook.

Mar 11, 2024

Stable Diffusion on Amazon SageMaker

NLP

transfomers

stablediffusion

Journey to learn about transformer models in SageMaker. Inspired by philschmid.

Nov 4, 2022

Notes on Natural Langage Processing with Deep Learning

NLP

word2vector

A notebook that reproduces some of the teaching content from Stanford CS224n Natural Language Processing with Deep Learning

Jan 4, 2022

Multilingual Joint Image & Text Embeddings

%%capture
!pip install sentence-transformers

Jan 1, 2022

Joint Image & Text Embeddings

First we need to install sentence-transformers

Jan 1, 2022

Machine Learning Model Monitoring

data drift

model drift

MLOps

Notes on machine learning model monitoring concepts, challenges and howto.

Dec 13, 2021

Github Actions and Snowflake Integration

github

snowflake

CICD

Using github actions to create a CICD data pipeline in snowflake data.

Oct 31, 2021

Dimensionality Reduction - Non-negative Matrix Factorization - NMF

jupyter

NMF

Dimensionality

TF-IDF

PMI

A notebook to evaluate topics on non-conformance reports.

Oct 25, 2021

Generative Deep Learning

machine_learning

This repository is intended as place to keep my current experimentations on generative deep learning using tensorflow.

Sep 22, 2021

Principal Component Analysis - PCA

jupyter

Dimensionality

PCA

Step-by-step use of PCA for dimensionality reduction.

Sep 21, 2021

Using Keras Tuner for hyperparameter tunning

kerastuner

tensorflow

Adaptation from notebook from C3_W1_Lab_1_Keras_Tuner from DeepLearningAi MLOPs Specialization - Course 3

Sep 5, 2021

Feature Engineering with Tensorflow

jupyter

tensorflow

Notebook from C2W1_Assignment lab assignment from DeepLearningAi MLOPs Specialization - Course 2 -

Sep 4, 2021

My Title for Template

jupyter

Dimensionality

PCA

template

This is a template description.

Jan 1, 2020

Categories