Tableau is a business intelligence software that allows anyone to connect to respective data, and then visualize and create interactive, sharable dashboards.
Tableau Desktop is a data analysis visualization tool, 10-100 times faster than other analytic tools. Built on groundbreaking technology, Tableau desktop converts imaged data into optimized data queries and users can create interactive dashboards.
The new features introduced in Tableau 9.1 includes:
Data: With new web data connector, it makes data accessible from anywhere.
Mobile: The new tableau comes with a high-resolution thumbnails, taking screenshot offline and high-level security for the data.
Enterprise: Easy enterprise deployment with active directory synchronization.
Visual Analytics: View proximity in the radial selection tool, also provides features like creating filter formulas and Zoom control on your data.
With instantaneous and powerful data visualization techniques, Tableau Software easily connects to any data source (Corporate Data Warehouse, Microsoft Excel or other). It supports quicker drop-and-down interface processes for creating visual reports of massive amounts of data. The best part is that all data in Tableau from various sources is created equally irrespective of the data type.
Tableau public is a free service that allow anyone to publish interactive data to the web. Once it is there on web, anyone can interact with the data, download it or create their own visualization.
Yes, one can create relational joins in tableau without creating a new table.
A much advanced, direct, precise and ordered way of viewing large volumes of data is called data visualization. It is the visual representation of data in the form of graphs and charts, especially when you can’t define it textually. You can show trends, patters and correlations through various data visualization software and tools; Tableau is one such data visualization software used by businesses and corporates.
Different Tableau files include :
While Tableau desktop performs data visualization and workbook creation, Tableau server is used to distribute these interactive workbooks and/or reports to the right audience. Users can edit and update the workbooks and dashboards online or Server but cannot create new ones. However, there are limited editing options when compared to desktop.
Tableau Public is again a free tool consisting of Desktop and Server components accessible to anyone.
The difference between published data source and embedded data source is that:
Published data source: It contains connection information that is independent of any workbook and can be used by multiple workbooks.
Embedded data source: It contains connection information and is associated with a workbook.
Data Modeling in Tableau allows rapid analysis of operational data that produce visual prints of data to help users identify consumer trends, sales targets and operational improvement. Data modeling processes minimize the use of manual changes in the data structure but promote automatic adaption to the changing environment.
It also allows real-time modeling by letting users share their analysis in the collaborative working environment.
The data source are distinguished on the basis of following characteristics
Tableau parameters are dynamic variables/values that replace the constant values in data calculations and filters. For instance, you can create a calculated field value returning true when the score is greater than 80, and otherwise false. Using parameters, one can replace the constant value of 80 and control it dynamically in the formula.
If data resides in a single source, it is always desirable to use Joins. When your data is not in one place blending is the most viable way to create a left join like the connection between your primary and secondary data sources.
The difference actually lies in the application. Parameters allow users to insert their values, which can be integers, float, date, string that can be used in calculations. However, filters receive only values users choose to ‘filter by’ the list, which cannot be used to perform calculations.
Users can dynamically change measures and dimensions in parameter but filters do not approve of this feature.
A table data extract is a compressed snapshot of data stored on disk and loaded into memory as required to render a Tableau. A TDE is a columnar store and reduce the input/output required to access and aggregate the values.
Facts are the numeric metrics or measurable quantities of the data, which can be analyzed by dimension table. Facts are stores in Fact table that contain foreign keys referring uniquely to the associated dimension tables. The fact table supports data storage at atomic level and thus, allows more number of records to be inserted at one time. For instance, a Sales Fact table can have product key, customer key, promotion key, items sold, referring to a specific event.
Dimensions are the descriptive attribute values for multiple dimensions of each attribute, defining multiple characteristics. A dimension table ,having reference of a product key form the fact table, can consist of product name, product type, size, color, description, etc.
Global quick filters are a way to filter each worksheet on a dashboard until each of them contains a dimension. They are very useful for worksheets using the same data source, which sometimes proves to a disadvantage and generate slow results. Thus, parameters are more useful.
Parameters facilitate only four ways to represent data on a dashboard (which are seven in quick filters). Further, parameters do not allow multiple selections in a filter.
Aggregation and disaggregation in Tableau are the ways to develop a scatterplot to compare and measure data values. As the name suggests, aggregation is the calculated form of a set of values that return a single numeric value. For instance, a measure with values 1,3,5,7 returns 1. You can also set a default aggregation for any measure, which is not user-defined. Tableau supports various default aggregations for a measure like Sum, average, Median, Count and others.
Disaggregating data refers to viewing each data source row, while analyzing data both independently and dependently.
Unlike Data Joining, Data Blending in tableau allows combining of data from different sources and platforms. For instance, you can blend data present in an Excel file with that of an Oracle DB to create a new dataset.
The concept of context filter in Tableau makes the process of filtering smooth and straightforward. It establishes a filtering hierarchy where all other filters present refer to the context filter for their subsequent operations. The other filters now process data that has been passed through the context filter.
Creating one or more context filters improves performance as users do not have to create extra filters on large data source, reducing the query-execution time.
You can create by dragging a filed into ‘Filters’ tab and then, Right-Click that field and select ‘’Add to Context”
Tableau takes time to place a filter in context. When a filter is set as context one, the software creates a temporary table for that particular context filter. This table will reload each time and consists of all values that are not filtered by either Context or Custom SQL filter.
There are a number of file types and extensions in Tableau:
.twb is the most common file extension used in Tableau, which presents an XML format file and comprises all the information present in each dashboard and sheet like what fields are used in the views, styles and formatting applied to a sheet and dashboard.
But this workbook does not contain any data. The Packaged workbook merges the information in a Tableau workbook with the local data available (which is not on server). .twbx serves as a zip file, which will include custom images if any. Packaged Workbook allows users to share their workbook information with other Tableau Desktop users and let them open it in Tableau Reader.
Data extracts are the first copies or subdivisions of the actual data from original data sources. The workbooks using data extracts instead of those using live DB connections are faster since the extracted data is imported in Tableau Engine.
After this extraction of data, users can publish the workbook, which also publishes the extracts in Tableau Server. However, the workbook and extracts won’t refresh unless users apply a scheduled refresh on the extract. Scheduled Refreshes are the scheduling tasks set for data extract refresh so that they get refreshed automatically while publishing a workbook with data extract. This also removes the burden of republishing the workbook every time the concerned data gets updated.
Viewing underlying SQL Queries in Tableau provides two options:
Create a Performance Recording to record performance information about the main events you interact with workbook. Users can view the performance metrics in a workbook created by Tableau.
Help> Settings and Performance> Start Performance Recording
Help> Setting and Performance > Stop Performance Recording• Reviewing the Tableau Desktop Logs located at C:Users\My DocumentsMy Tableau Repository. For live connection to data source, you can check log.txt and tabprotosrv.txt files. For an extract, check tdeserver.txt file.
Tableau provides a distinct and powerful tool to control the output display known as Page shelf. As the name suggests, the page shelf fragments the view into a series of pages , presenting a different view on each page, making it more user-friendly and minimizing scrolling to analyze and view data and information. You can flip through the pages using the specified controls and compare them at a common axle.
Performance testing is again an important part of implementing Tableau. This can be done by loading Testing Tableau Server with TabJolt, which is a “Point and Run” load generator created to perform QA. While TabJolt is not supported by Tableau directly, it has to be installed using other open source products.
The maximum number of 32 tables can be joined in Tableau. A table size must also be limited to 255 columns (fields).
Create two sets, one for top 5 another for bottom 5 and the join these two sets displaying a unique set of total 10 rows.
TDE is a Tableau desktop file that contains with a .tde extension referring to the file containing data extracted from external sources like MS Excel, MS Access or CSV file.
By adding the same calculation to ‘Group By’ clause in SQL query or creating a Calculated Field in the Data Window and using that field whenever you want to group the fields.
Yes, parameters do have their independent dropdown lists enabling users to view the data entries available in the parameter during its creation.
Normal Filter is used to restrict the data from database based on selected dimension or measure. But Quick Filters are used to give a chance to user for dynamically changing data members at run time.
A syntax that supports aggregation at dimensionalities other than the view level. With level of detail expressions, you can attach one or more dimensions to any aggregate expression.
A pane that contains formatting settings that control the entire worksheet, as well as individual fields in the view.
A saved subset of a data source that you can use to improve performance and analyze offline.
A page where you can set up your data source. The Data Source page generally consists of four main areas: left pane, join area, preview area, and metadata area.
Tableau Reader is a free viewing application that lets anyone read and interact with packaged workbooks created by Tableau Desktop.
It is a text table view. Use text tables to display the numbers associated with dimension members.
A new field that you create by using a formula to modify the existing fields in your data source.
A pane on the left side of the workbook that displays the fields of the data sources to which Tableau is connected.
Show Me is used to apply a required view to the existing data in the worksheet. Those views can be a pie chart, scatter plot or a line chart.
It contains connection information that is independent of any workbook and can be used by multiple workbooks.
A story is a sheet that contains a sequence of worksheets or dashboards that work together to convey information
It refers to the file that contains data extracted from external sources like MS Excel, MS Access or CSV file.
Parameters are dynamic values that can replace constant values in calculations.
It Imports the entire data source into Tableau's fast data engine as an extract and saves it in the workbook.
It Creates a direct connect to the data source and speed up access.
Data blending is used to blend data from multiple data sources on a single worksheet. The data is joined on common dimensions.
These are inbuilt calculations in tableau which we normally use to calculate Percentange chages.
More Extracts, filters and depends on data sources.
The Pages shelf is used to control the display of output by choosing the sequence of display.
You can compare multiple measures using dual axes, which are two independent axes that are layered on top of each other.
In a context filter the filter condition is applied first to the data source and then some other filters are applied only to the resulting records.
An alternative name that you can assign to a field or to a dimension member.
A sheet where you build views of your data by dragging fields onto shelves.
They are Named areas to the left and top of the view. You build views by placing fields onto the shelves. Some shelves are available only when you select certain mark types.
A card to the left of the view where you can drag fields to control mark properties such as type, color, size, shape, label, tooltip, and detail.
Custom Filters ,Context Filters, Normal Filters.
It is a file with a .twb extension that contains one or more worksheets (and possibly also dashboards and stories).