Home
About Us
Courses
Login/Profile

Home
Courses
Practical NLP With Transformers

Practical NLP With Transformers

Tokenization

Tokenization in the context of Natural Language Processing (NLP) is the process of dividing text into smaller parts called tokens.

Transformer models cannot receive raw strings as input instead, they assume the text has been tokenized and encoded as numerical vectors

There are different type of Tokenization

Character Tokenization
Word Tokenization
Subword Tokenization

We will discuss each of them in detail

No comments yet! You be the first to comment.

Leave a Reply Cancel reply

You must be logged in to post a comment.

One Hot Encoding

Character Tokenization

Unlock the potential of artificial intelligence with Origins, your gateway to cutting-edge AI courses

GET HELP

Contact Us
Latest Articles
FAQs
Privacy

CONTACT US

Email: contact@originshq.com

Linkedin

Address : Sector 63A, Anishi's Utsav, Noida

Copyright © 2024 Origins Ai

Modal title

Main Content