Abstract: As legislation develops over time, the burden upon businesses can change drastically. Data scientists from Bardess have collaborated with a research team within the Government of Ontario to investigate the use of advanced natural language processing (NLP) and machine learning (ML) techniques to analyze legal documents including statutes and regulations. Using the Accessibility for Ontarians with Disabilities Act (AODA) as a starting point, we developed a multi-stage analysis. On the higher level, the goal was to simply identify and automatically detect parts of the legislature that indicate legislative burden and categorize them as being primarily burdens upon business or government departments. The second level of analysis aims at understanding patterns of similarities and differences between different classes of burden using data mining and clustering techniques. Finally, the objective of the analysis is expanded to include other legislative texts, using ML algorithms to detect burdens which have been duplicated across multiple statutes and acts. This latter work supports the Government of Ontario to develop leaner legislature more efficiently. Overall this work indicates how NLP and ML techniques can be brought to bear on complex legislative problems, further emphasizing the increasing utility of these techniques in government and industry.
In this hands-on workshop, we'll first describe the legislative/business context for the initiative, then walk attendees through the technical implementation. The work will be conducted by combining various techniques from the NLP toolbox, such as entity recognition, part-of-speech tagging, automatic summarization, and topic modeling. Work will be conducted in Python, making use of libraries for NLP such as spacy and nltk, and the ML library scikit-learn. We will also showcase interactive dashboards which have been created using the BI tool Qlik to allow exploration of the results of the analysis.
Bio: Dr. Daniel Parton leads the data science practice at the analytics consultancy, Bardess. He has a background in academia, including a PhD in computational biophysics from University of Oxford, and previously worked in marketing analytics at Omnicom. He brings both technical and management experience to his role of leading cross-functional data analytics teams, and has led successful and impactful projects for companies in finance, retail, tech, media, manufacturing, pharma and sports/entertainment industries.