How to Generate Text in Python

Text is my personal favorite medium for machine learning. Here is why: In computing, a picture is worth a (few hundred) thousand words. As a result, modeling text is more space and compute efficient than visual models. Text arrived first to the internet. This lead time has resulted in better algorithms, and bottomless data. Interpretability…

Read More


Exploring Latent Word Vectors using Path Finding

In my previous post, I created a product recommendation system using word embeddings. Today, we’ll take it a step further and explore how we can use these vectors to find the shortest path between pairs of words. Dijkstra’s algorithm Dijkstra’s algorithm is a method for finding the shortest path between any two vertices of a…

Read More


Understanding Evolutionary Algorithms in Python

In this post, we will be diving into the world of context-free grammars to use in evolutionary algorithms. What is context-free grammar? Context-free grammar, also known as CFG, is a specific type of grammar used in computer science and linguistics. It is a set of production rules that define the structure of a language. CFG…

Read More


How to Build a Product Recommendation System in Python

Want to boost sales and stand out from the competition? Implementing a recommendation system can be the key. In this blog post, we’ll show you how to build a product recommendation system using the Python programming language. We’ll be using a modified version of the popular word2vec algorithm to measure the similarity between products, helping…

Read More


Tokenization in Python Using SentencePiece

What is tokenization? Tokenization involves breaking text into individual words, making it easier for computers to understand and analyze meaning. This task applies to various Natural Language Processing (NLP) applications such as language translation, text summarization, and sentiment analysis. In this post, we will explore using SentencePiece, a widely used open-source library for tokenization in…

Read More