tasnim assali

Nov 20, 20212 min

Importing Data in Python

In this blog, I'm going to show a tutorial on how to import data in python for different types of data files. Python offers modules that help to import different data with various files formats.

First, I going to import data with CVS format as shown in the code below.

The CSV format enables us to read each row in the file using a comma as a delimiter.

In the beginning, I open the file in "read-only" mode, affect the delimiter, and then I used a for loop to read each row from the CSV file.

import csv
 

 
with open("C:/Users/asus/Desktop/train.csv,'r') as custfile:
 
rows=csv.reader(custfile,delimiter=',')
 
for r in rows:
 
print(r)

Also, there is another method to import .csv files by using pandas as shown below:

import pandas as pd
 
import numpy as np
 
data = pd.read_csv("C:/Users/asus/Desktop/train.csv", index_col="Loan_ID")
 
data.head(10)

First, I import pandas and numpy, second, load the data with pd.read_csv module, then show the first 10 rows and the result shown below:

The second file format I import with python is .txt format. as shown in the next code. First I import the NumPy library, load the file using loadtxt module.

import numpy as np
 
filename = 'MNIST_header.txt'
 
data = np.loadtxt(filename, delimiter=',', skiprows=1, dtype=str)
 
print(data)

Pandas library can handle excel files using the read_excel module. As shown below the example that imports data from an excel file.

First, I import the pandas library, import the excel file by using the pd.ExcelFile module the use df.parse module with specifying "loan_ID"

and finally, I showed the first 10 lines.

import pandas as pd
 

 
df = pd.ExcelFile("C:/Users/asus/Desktop/train.xlsx")
 
data=df.parse("Loan_ID")
 
print(data.head(10))
 

Also in python, we can connect to database servers using a module called pyodbc. This method offers to us the opportunity to import data from relational sources using a SQL query. As shown below:

import pyodbc
 
sql_conn = pyodbc.connect("Driver={SQL Server};Server=serverName;UID=UserName;PWD=Password;Database=sqldb;")
 
data_sql = pd.read_sql_query(SQL QUERY’, sql_conn)
 
data_sql.head()

    0