top of page
learn_data_science.jpg

Data Scientist Program

 

Free Online Data Science Training for Complete Beginners.
 


No prior coding knowledge required!

Importing Data in Python

In this blog, I'm going to show a tutorial on how to import data in python for different types of data files. Python offers modules that help to import different data with various files formats.

First, I going to import data with CVS format as shown in the code below.

The CSV format enables us to read each row in the file using a comma as a delimiter.


In the beginning, I open the file in "read-only" mode, affect the delimiter, and then I used a for loop to read each row from the CSV file.


import csv

with open("C:/Users/asus/Desktop/train.csv,'r') as custfile:
rows=csv.reader(custfile,delimiter=',')
for r in rows:
print(r)

Also, there is another method to import .csv files by using pandas as shown below:


import pandas as pd
import numpy as np
data = pd.read_csv("C:/Users/asus/Desktop/train.csv", index_col="Loan_ID")
data.head(10)

First, I import pandas and numpy, second, load the data with pd.read_csv module, then show the first 10 rows and the result shown below:


The second file format I import with python is .txt format. as shown in the next code. First I import the NumPy library, load the file using loadtxt module.

import numpy as np
filename = 'MNIST_header.txt'
data = np.loadtxt(filename, delimiter=',', skiprows=1, dtype=str)
print(data)

Pandas library can handle excel files using the read_excel module. As shown below the example that imports data from an excel file.

First, I import the pandas library, import the excel file by using the pd.ExcelFile module the use df.parse module with specifying "loan_ID"

and finally, I showed the first 10 lines.

import pandas as pd

df = pd.ExcelFile("C:/Users/asus/Desktop/train.xlsx")
data=df.parse("Loan_ID")
print(data.head(10))

Also in python, we can connect to database servers using a module called pyodbc. This method offers to us the opportunity to import data from relational sources using a SQL query. As shown below:

import pyodbc
sql_conn = pyodbc.connect("Driver={SQL Server};Server=serverName;UID=UserName;PWD=Password;Database=sqldb;")
data_sql = pd.read_sql_query(SQL QUERY’, sql_conn)
data_sql.head()
0 comments

Recent Posts

See All
bottom of page