info
Thank you for visiting my website!

Python Libraries


Stats

StatsModels

Most commonly used for statistical analysis.


Stats

SciPy

Most commonly used for statistical analysis.


Imbalanced Data

imbalanced-learn

Most commonly used for statistical analysis to deal with imbalanced datasets.


ML

scikit-learn

Library used for predictive data analysis (machine learning).


Language

NLTK/spaCy

Language processing to parse through words and sentences.


Maps

Folium Package

This library is used for visualizing map data.


Visualize

Seaborn Package

Most commonly used to visualize data - can be used in conjunction with matplotlib.


Visualize

matplotlib

This library is used for visualization of information, usually used in conjuction with NumPy for the mathematical extension.


Vectorize

NumPy Library

This library is most commonly used for working with vectors (i.e. DataFrames) usually in conjunction with Pandas.


Data

Pandas Library

Pandas is a library in python that has functionality for data manipulation and analysis. It allows us to organize lists and store them as DataFrames. DataFrames are data structures which have labeled rows and columns. We want to create these dataframes to then be able to edit, manipulate, and analyze the data in a clear way.


Automation

Selenium Library

Selenium is an open-source code initially written by Baiju Muthukadan. It allows you to send Python commands to the web. This means it will let you access the web from your Python program and let it automate things you normally would need to do yourself, such as tap on buttons, enter content in structures, check whether everything is okay with your site.

Selenium is mostly used as a testing package, however, I delved into using it to send commands to a webpage that needed a location to be entered before data could be taken. This is what I learnt: