Finding Currajong - Fuzzy string searches in Python
With the advent of Big Data, data scrubbing (finding matches in imperfect data) becomes an important issue. In this talk I will look at several algorithms for doing partial matches on strings and names and compare some strengths and weaknesses of each approach. We will also look at Python code and libraries for this work.
Martin is a software developer, tech lead and manager at the Bureau of Meteorology. His passion is solving complex puzzles, whether they be finding elegant solutions to complex algorithms, getting developers excited about code reviews or rewriting 10,000 lines of tangled Fortran code in a couple hundred lines of Python.