-
โ https://research.google/blog/a-fast-wordpiece-tokenization-system/
-
โ BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
-
โ https://github.com/huggingface/tokenizers
-
โ https://www.tensorflow.org/text/guide/subwords_tokenizer
-