Working with a Data Extract
1. Extract Data
>> Concept
An extract is a copy of the data that brought into the Tableau data engine.
>> Advantages
- Use the Tableau data engine to run queries rather than send a query to the data source
- Reduce the time it takes for queries to run
 
- Allow you to keep a copy of the data:
- Can be accessed offline
 
- May include only a subset of the data
>> Limitations
- Data doesn’t update automatically
- Need to refresh the Extract
- A Live connection queries the data from the database and the data are updated every time you open your workbook
 
- Your extracted data source may not include all the fields required for the views
- Because extracts may include only a subset of the data
 
2. Create and edit Extracts
>> Two places to create extracts
- 
From the Data Source page   
 
- 
On the worksheet - [Data] Pane --> right-click the data source --> [Use Extract] - Switch between a live or extracted data connection: check / uncheck [Use Extract]     
>> Two places to edit Extracts
- 
From the data source page   
 
- 
On the worksheet - [Data] Pane --> right-click the data source --> [Extract Data]   
 [Edit extracts from the Extract Data dialog box]   
>> Update Data
- 
[Data] Pane --> right-click the data source --> [Extract] --> [Refresh]   
 
>> Hide / Unhide fields
- 
Hide unused fields   
 
- 
Unhide fields - 
[Data] Pane --> right-click --> [Show Hidden Fields]    
- 
Field to show --> right-click --> check [Unhide]   
 
- 
3. Refresh Extracts
>> Two Types:
- Full extract refresh (default) [完全刷新]
- all of the rows are replaced with the data in the original data source
- [good] ensures that you have an exact copy of what is in the original data
- [bad] can sometimes take a long time and be expensive on the database
 
- Incremental extract refresh [增量刷新]
- configure a refresh to add only the rows that are new since the previous time you extracted the data
- Note: If the data structure of the source data changes (for example, a new column is added), you will need to do a full extract refresh before you can start doing incremental refreshes again.
 
>> Full Refresh (Default)
- 
[Data] Pane --> right-click the data source --> [Extract] --> [Refresh]   
>> Update the data extract
Overwrite the existing extract by creating a new extract after the data extract edit (e.g. unhide fields)
- 
[Data] Pane --> right-click --> [Extract Data] 
- 
[Number of Rows] --> check [All rows] --> check [Incremental refresh] 
- 
Click [Extract]      

