Character Encoding in NLP: The Role of ASCII and Unicode | by Javi Sánchez | Jan, 2023
A closer look at the technicalities and practical applicationsIn this article we will cover the topic of character encoding standards, specifically focusing on the ASCII and Unicode systems. We will dive into how they work and their role in deep learning. In addition, we will provide some examples of character encoding using Tensorflow, to have an overview of how this library manages strings on the inside.Photo by Giammarco on UnsplashBut first of all, we will present some important concepts.Character encoding is a system…