Measuring string similarity in BigQuery using SQL | by Romain Granger | Jun, 2022
Use Levenshtein distance to discover similar or duplicated values, clean your data, and more!Photo by Suraj Kardile on UnsplashUsing the Levenshtein distance methodThis method can be used among others (Soundex, LIKE statement, Regexp) to perform string similarity or string matching in order to identify two elements (text, strings, inputs) that are similar but not identical.This method can be used for a variety of applications, including identifying duplicates, handling misspelled user input data, cleaning customer data,…