Read Excel File
Reads the contents of an Excel file
Processing
For each input row, the step reads the entire content of a workbook. The resulting dict value is available as results.workbook
Settings
Name | Type | Description |
---|---|---|
Workbook | ||
Input | ||
Workbook |
string |
The path to the file to read. Relative paths are interpreted as relative to the flow file. Evaluated for each input row |
Sheets |
The sheets to extract from the file. If the value is Otherwise the following types are supported: List If the list is empty, all sheets are extracted. Otherwise each sheet in the workbook is matched against each list item in order to decide whether to extract it. If any list entry matches the sheet, it is extracted. List entries can take the following form:
String A string is interpreted as the name of a sheet to extract - only the sheet of the given name is extracted, if present. Function Functions of the form Example function that indicates to import only sheets whose name starts with ‘data_’ and which are not hidden in the workbook.
Boolean
Evaluated for each input row |
|
Output | ||
Sheets |
Indicates the structure used to represent sheets in the result. as dict - sheets are returned as a dict indexed by name i.e.
as list - sheets are returned as a list in order of appearance in the workbook i.e.
Evaluated for each input row |
|
Cells |
Defines how cells are represented in the result.
Temporal Values Excel saves date and time data internally as an anchored numeric value. The cell format makes it so that this number appears as a date or time value in Excel. Therefore the Excel value type of temporal values is ‘NUMERIC’. The step detects temporal values and generates a proper datetime as the cell’s value. Note Excel saves data in a sparse table format, and empty cells that have never been edited or formatted are often technically missing from Excel workbook files. Cells that are not technically present in the workbook are always represented as Evaluated for each input row |
|
Data | ||
Read Data Rows |
If checked, data rows are included in the result. Evaluated for each input row |
|
Read Text Boxes |
If checked, text boxes data is included in the result. Evaluated for each input row |
|
Read Images |
If checked, image data is included in the result. Evaluated for each input row |
|
Timezone |
Timezone used for extracted datetimes. Evaluated for each input row |
|
Misc | ||
Password |
Password used to read password-protected xls files. Evaluated for each input row |
Results
Name | Type | Description |
---|---|---|
workbook |
string |
The complete contents of the Excel file as a dict. It has the following structure:
|