Clean Your Data with Data Interpreter
Quote from Reddy on March 9, 2017, 11:46 pmClean Your Data with Data Interpreter
Sometimes, the format of your Google Sheets or Microsoft Excel data makes it difficult to analyze in Tableau. For example, your data might include additional tables, sub-tables, hierarchical headers, extraneous headers and footers, or empty rows and columns. Data Interpreter detects these sub-tables so that you can work with a subset of your data independently of the other data. It also removes the extraneous information to help prepare your data source for analysis.
After you set up the data source, if Tableau detects sub-tables, unique formatting, or that the data contains some extraneous information, it prompts you to use Data Interpreter.
Note: When you clean your data with Data Interpreter, Data Interpreter cleans all the data associated with a connection in the data source. Data Interpreter does not change the underlying data.
Turn on Data Interpreter and review results
After you have connected to your data and set up your data source, on the data source page, select the Clean with Data Interpreter check box.
Click Review data.
A copy of your data source opens in Excel on the Key for the Data Interpreter tab.
Review the annotation key to find out how to read the results.
Click the subsequent tabs to review how Data Interpreter interpreted the data source.
You can also review Data Interpreter results directly in the grid below Data Interpreter.
If Data Interpreter does not provide the expected results, you can clear the Clean with Data Interpreter check box to use the original data source.
If Data Interpreter detects additional tables in your data, you can replace the current table with the found table by dragging it to the canvas.
If Data interpreter has misidentified the range of the found table, click the table drop-down arrow on the canvas, and then select Edit Found Table to adjust the corners of the found table (the top-left cell and bottom-right cell of the table).
When Data Interpreter is not available
The Data Interpreter option might not be available for the following reasons:
The data source is already in a format that Tableau can interpret: If Tableau Desktop doesn't need extra help from Data Interpreter to handle unique formatting or extraneous information, Data Interpreter option is not available.
- Many rows or many columns: Data Interpreter option is not be available when your data has the following attributes:
- Data contains more than 2000 columns.
- Data contains more than 3000 rows and more than 150 columns.
The data source is not supported: Data Interpreter is only available for Google Sheets and Excel data sources. Your Excel data must be in the XLS and XLSX formats. Excel data in CSV formats is not supported.
Clean Your Data with Data Interpreter
Sometimes, the format of your Google Sheets or Microsoft Excel data makes it difficult to analyze in Tableau. For example, your data might include additional tables, sub-tables, hierarchical headers, extraneous headers and footers, or empty rows and columns. Data Interpreter detects these sub-tables so that you can work with a subset of your data independently of the other data. It also removes the extraneous information to help prepare your data source for analysis.
After you set up the data source, if Tableau detects sub-tables, unique formatting, or that the data contains some extraneous information, it prompts you to use Data Interpreter.
Note: When you clean your data with Data Interpreter, Data Interpreter cleans all the data associated with a connection in the data source. Data Interpreter does not change the underlying data.
Turn on Data Interpreter and review results
-
After you have connected to your data and set up your data source, on the data source page, select the Clean with Data Interpreter check box.
-
Click Review data.
A copy of your data source opens in Excel on the Key for the Data Interpreter tab.
-
Review the annotation key to find out how to read the results.
-
Click the subsequent tabs to review how Data Interpreter interpreted the data source.
You can also review Data Interpreter results directly in the grid below Data Interpreter.
If Data Interpreter does not provide the expected results, you can clear the Clean with Data Interpreter check box to use the original data source.
-
If Data Interpreter detects additional tables in your data, you can replace the current table with the found table by dragging it to the canvas.
If Data interpreter has misidentified the range of the found table, click the table drop-down arrow on the canvas, and then select Edit Found Table to adjust the corners of the found table (the top-left cell and bottom-right cell of the table).
When Data Interpreter is not available
The Data Interpreter option might not be available for the following reasons:
-
The data source is already in a format that Tableau can interpret: If Tableau Desktop doesn't need extra help from Data Interpreter to handle unique formatting or extraneous information, Data Interpreter option is not available.
- Many rows or many columns: Data Interpreter option is not be available when your data has the following attributes:
- Data contains more than 2000 columns.
- Data contains more than 3000 rows and more than 150 columns.
-
The data source is not supported: Data Interpreter is only available for Google Sheets and Excel data sources. Your Excel data must be in the XLS and XLSX formats. Excel data in CSV formats is not supported.