Hello World! Today’s post is going to be computer science related. Recently I’ve started Data Science with Python track at DataCamp. DataCamp offers interactive courses with amazing instructors. All on topics in data science, statistics and machine learning.
In this post I will share my notes from the first course of Data Science with Python track. I am hoping that this post will share with you some basics of python for data science.
“Data Science”
You probably heard this term before, if not, don’t worry, now you do. 🙂
A nice definition of Data Science by University of Berkeley; Data science continues to evolve as one of the most promising and in-demand career paths for skilled professionals. Today, successful data professionals understand that they must advance past the traditional skills of analyzing large amounts of data, data mining, and programming skills.
If you are a visual learner like myself here is a 5 minute video by Simplilearn. The video contains a bit detailed information but it gives sense of the big picture.
I would like to write another post, what data science really is and why it is so important. But in this post it will more like me sharing my lecture notes from the first course.
Let’s get started!
What is List?
- Powerful
- Collection of values
- Holds different types
- Change, add, or remove elements
Changing list elements
Adding and removing elements
Functions
- Piece of reusable code
- Solves particular task
- Call function instead of writing the whole code yourself
Here are some built-in functions
- max(): Maximum of the list
- min(): Minimum of the list
- len(): length of the list or string
- index(): get index of the element in list. (this a method)
- reverse(): reverse a list. (this is a method)
List methods
String methods
Methods
- Everything = object
- Objects have methods associated, depending on type
Packages
- Directory of python scripts
- Each script is a module
- Specify functions, methods, and types
- Thousands of packages are available such as:
- NumPy
- Pandas
- Matplotlib
- Scikit-learn
How to use a package:
- Install package
- Import package
- Now you can use the package
Example use of NumPy:
We can also play on two dimensional arrays with NumPy:
Indexing in multi-dimensional NumPy arrays
We can also do statistical calculations using NumPy package (Such as: mean, median, correlation coefficient, standard deviation, sort, sum)
That’s all for now!
What are your thoughts on data science?
Did you know that Harvard University mentioned Data Science as “the sexiest job” of 21st century.