How to import data from Excel to DataSet in Python
EasyXLS Excel library can be used to import Excel files with Python on Windows, Linux, Mac or other operating systems. The integration vary depending on the operating system or if the bridge for .NET Framework of Java is chosen:
To download the trial version of EasyXLS Excel Library, press the below button:
If you already own a license key, you may login and download EasyXLS from your account.
Install the downloaded EasyXLS installer for v8.6 or earlier.
Step 2: License file setup
Step required for EasyXLS v9.0 or later.
If you are using a trial, generate a trial license file from EasyXLS trials page. The trial license is valid for 30-days.
If you own a license key, you may login to the account that purchased the license and generate the license file from: https://www.easyxls.com/my-orders
Setup the license file into your project using these guidelines.
Step 3: Install Pythonnet
For the installation you need to run "pip" command as it follows. Pip is a package-management system used to install and manage software packages written in Python. <Python installation path>\Scripts>pip install "pythonnet.whl"
Step 4: Include EasyXLS library into project
EasyXLS.dll must be added to your project. EasyXLS.dll can be found: - Inside the downloaded archive at Step 1 for EasyXLS v9.0 or later - Under installation path for EasyXLS v8.6 or earlier, in "Dot NET version" folder.
Step 5: Run Python code that imports data from Excel to DataSet
Execute the following Python code that imports Excel data to DataSet.
"""----------------------------------------------------------
Tutorial 34
This tutorial shows how to import Excel to DataSet in Python.
The data is imported from the active sheet of the Excel file
(the Excel file generated in Tutorial 09).
----------------------------------------------------------"""import clr
import gc
clr.AddReference('EasyXLS')
from EasyXLS import *
print("Tutorial 34\n-----------\n")
# Create an instance of the class that imports Excel files
workbook = ExcelDocument()
# Import Excel file to DataSet
print("Reading file C:\\Samples\\Tutorial09.xlsx.\n")
ds = workbook.easy_ReadXLSXActiveSheet_AsDataSet("C:\\Samples\\Tutorial09.xlsx")
# Display imported DataSet values
dt = ds.Tables[0]
for row in range(dt.Rows.Count):
for column in range(dt.Columns.Count):
print("At row " + str(row + 1) + ", column " + str(column + 1) +
" the value is '" + dt.Rows[row].ItemArray[column] + "'")
# Dispose memory
gc.collect()
EasyXLS on Linux, Mac, Windows using Java with Python
If you opt for the Java version of EasyXLS, a similar code as above requires Py4J, Pyjnius or any other bridge between Python and Java.
To download the trial version of EasyXLS Excel Library, press the below button:
If you already own a license key, you may login and download EasyXLS from your account.
Install the downloaded EasyXLS installer for v8.6 or earlier.
Step 2: License file setup
Step required for EasyXLS v9.0 or later.
If you are using a trial, generate a trial license file from EasyXLS trials page. The trial license is valid for 30-days.
If you own a license key, you may login to the account that purchased the license and generate the license file from: https://www.easyxls.com/my-orders
Setup the license file into your project using these guidelines.
Step 3: Install Py4j
For the Py4j installation you need to run "pip" command as it follows. Pip is a package-management system used to install and manage software packages written in Python. <Python installation path>\Scripts>pip install "py4j.whl"
Step 4: Create additional Java program
The following Java code needs to be running in the background prior to executing the Python code.
import py4j.GatewayServer;
publicclass GatewayServerApp {
publicstaticvoid main(String[] args) {
GatewayServerApp app = new GatewayServerApp();
// app is now the gateway.entry_point
GatewayServer server = new GatewayServer(app);
server.start();
}
}
Step 5: Add py4j library to CLASSPATH
py4j.jar must be added to your classpath of the additional Java program. py4j.jar can be found after installing Py4j, in "<Python installation path>\share\py4j" folder.
Step 6: Add EasyXLS library to CLASSPATH
EasyXLS.jar must be added to your classpath of the additional Java program. EasyXLS.jar can be found: - Inside the downloaded archive at Step 1 for EasyXLS v9.0 or later - Under installation path for EasyXLS v8.6 or earlier, in "Lib" folder.
Step 7: Run additional Java program
Start the gateway server application and it will implicitly start Java Virtual Machine as well.
Step 8: Run Python code that imports data from Excel to ResultSet
Execute a code as below Python code that imports Excel data to ResultSet.
"""------------------------------------------------------------
Tutorial 34
This tutorial shows how to import Excel to ResultSet in Python.
The data is imported from the active sheet of the Excel file
(the Excel file generated in Tutorial 09).
------------------------------------------------------------"""import gc
from py4j.java_gateway import JavaGateway
from py4j.java_gateway import java_import
gateway = JavaGateway()
java_import(gateway.jvm,'EasyXLS.*')
java_import(gateway.jvm,'java.io.FileInputStream')
print("Tutorial 34\n-----------\n")
# Create an instance of the class that imports Excel files
workbook = gateway.jvm.ExcelDocument()
# Import Excel file to ResultSet
print("Reading file C:\\Samples\\Tutorial09.xlsx.\n")
file = gateway.jvm.FileInputStream("C:\\Samples\\Tutorial09.xlsx")
rs = workbook.easy_ReadXLSXActiveSheet_AsResultSet(file)
# Display imported ResultSet values
columnCount = rs.getMetaData().getColumnCount()
row = 0
while rs.next():
for column in range(columnCount):
print("At row " + str(row + 1) + ", column " + str(column+1) +
" the value is '" + rs.getString(column+1) + "'")
row=row+1
# Dispose memory
gc.collect()