Search

My Learning Corner
My Learning Corner
  • Machine Learning
  • Deep Learning
  • Personal
  • Data Science
  • System Design
  • Data Structures and Algo
  • SQL
  • Soft Skills
  • Courses
  • Local Links
    Hugo Reference Interview Tracker In Progress Activities Notion WebSpace Video Collections Classroom Task Tracker Resource List
  • External Links
    LeetCode TopCoder Project Euler Safari Online Educative Uplevel Interview Kickstart Coursera PluralSight LinkedIn Learning
  • Light Dark Automatic

Coding/DataScience/SQL

Dec 22, 2024
Tokenization

The process of breaking a sentence into smaller pieces, or tokens, is called tokenization. The tokens help the model understand the text better. For a sentence like, IBM taught me tokenization, tokens can be IBM, taught, me and tokenization. Different AI models might use different types of tokens. The program that breaks down text into individual tokens is called a tokenizer. Tokenizers generate tokens primarily through three tokenization methods, word based, character based, and subword based.

Nov 1, 2023
VSCode Reference

Visual Studio Code

Oct 29, 2023
Asserts in Python

Asserts in Python

Oct 29, 2023
TMUX cheatsheet

Tmux Common Commands

Oct 28, 2023
See all

System Design

Dec 22, 2024
Tokenization

The process of breaking a sentence into smaller pieces, or tokens, is called tokenization. The tokens help the model understand the text better. For a sentence like, IBM taught me tokenization, tokens can be IBM, taught, me and tokenization. Different AI models might use different types of tokens. The program that breaks down text into individual tokens is called a tokenizer. Tokenizers generate tokens primarily through three tokenization methods, word based, character based, and subword based.

Nov 1, 2023
VSCode Reference

Visual Studio Code

Oct 29, 2023
Asserts in Python

Asserts in Python

Oct 29, 2023
TMUX cheatsheet

Tmux Common Commands

Oct 28, 2023
See all

Posts

Mar 1, 2025 120 min read
Behavioral Questions

Behavioral Questions

Mar 1, 2025 42 min read
Behavioral Questions

Behavioral Questions

Sep 23, 2023 1 min read
Dashboard

Solutions by type

Jul 22, 2023 3 min read
Machine Learning Engineer and MLOPs Interview Prepation Guide

First blog

May 31, 2023 2 min read
DataIku Certification Quiz

DataIKU Core Designer Quiz

See all posts

Popular Topics

assert BFS data_structure_algo DFS Djikstra DP feature_store graphs hugo interview leetcode metric ML MLOps pipelines, gcp, mlops python Quiz, Dataiku Recommender System system design system_design

Published with Wowchemy — the free, open source website builder that empowers creators.

Cite
Copy Download