[DOC] Pivot Table by ajdapretnar · Pull Request #3825 · biolab/orange3

ajdapretnar · 2019-05-28T13:22:28Z

Issue

Pivot Table needs docs.
#3823

Description of changes

Add documentation for the widget.

Includes

Code changes
Tests
Documentation

codecov · 2019-05-28T13:32:48Z

Codecov Report

Merging #3825 into master will increase coverage by 0.16%.
The diff coverage is n/a.

@@            Coverage Diff             @@
##           master    #3825      +/-   ##
==========================================
+ Coverage   84.76%   84.93%   +0.16%     
==========================================
  Files         374      376       +2     
  Lines       69172    70136     +964     
==========================================
+ Hits        58637    59571     +934     
- Misses      10535    10565      +30

janezd · 2019-05-31T11:22:03Z

+
+**Outputs**
+
+- Pivot Table: contingency matrix as set in the widget


... as showed? as seen? showed?

janezd · 2019-05-31T11:22:36Z

+
+- Pivot Table: contingency matrix as set in the widget
+- Filtered Data: subset selected from the plot
+- Grouped Data: data table grouped by row values


Maybe - Grouped Data: aggregates over groups defined by row values?

janezd · 2019-05-31T11:23:29Z

+- Filtered Data: subset selected from the plot
+- Grouped Data: data table grouped by row values
+
+**Pivot Table** summarizes the data of a more extensive table into a table of statistics. The statistics can include sums, averages, counts, etc. The widget also allows selecting a subset from the plot and grouping by row values, which have to be a discrete variable. Data with only numeric variables cannot be displayed in the plot.


plot? Probably table? (This appears twice.)

janezd · 2019-05-31T11:24:17Z

+
+![](images/Pivot-stamped.png)
+
+1. Discrete or numeric variable that will be used for row values. Numeric variables are considered as integers in this case. Variable values will appear as rows in the table.


The last sentence is perhaps redundant. You can also remove "in this case." Also perhaps that will be.

janezd · 2019-05-31T11:24:53Z

+
+1. Discrete or numeric variable that will be used for row values. Numeric variables are considered as integers in this case. Variable values will appear as rows in the table.
+2. Discrete variable that will be used for column values. Variable values will appear as columns in the table.
+3. Values that will be used for aggregation. Aggregated values will appear as cells in the table.


Consider removing that will be; also above.

janezd · 2019-05-31T11:26:42Z

+3. Values that will be used for aggregation. Aggregated values will appear as cells in the table.
+4. Aggregation methods:
+   - For any variable type:
+      - *Count*: number of instances that appear in the data


- *Count*: size of the group, that is, the number of instances with the given row and column value.

I'm not totally sure this is better, though. :)

janezd · 2019-05-31T11:27:42Z

+4. Aggregation methods:
+   - For any variable type:
+      - *Count*: number of instances that appear in the data
+      - *Count defined*: number of non-empty (not NaN) instances in the data.


Huh, maybe "number of instances with this combination of the row and column value, for which the value that is used for aggregation is defined".

janezd · 2019-05-31T11:29:20Z

+
+![](images/Pivot-discrete.png)
+
+Example of a pivot table with only discrete variables selected. We are using *heart-disease* data set for this example. We are using the values of *diameter narrowing* as row values, namely 0 and 1. Our columns are values of *gender*, namely female and male. We are using *thal* as values in our cells.


We are using the values of *diameter narrowing* as row values -> Rows correspond to values of *diameter narrowing* variable.

You can skip namely.

janezd · 2019-05-31T11:29:41Z

+
+![](images/Pivot-continuous.png)
+
+Example of a pivot table with numeric variables. We are using *heart-disease* data set for this example. We are using the values of *diameter narrowing* as row values, namely 0 and 1. Our columns are values of *gender*, namely female and male. We are using *rest SBP* as values in our cells.


Same as above.

janezd · 2019-05-31T11:30:40Z

+Example
+-------
+
+We are using *Forest Fires* for this example. The data is loaded in the [Datasets](../data/datasets.md) widget and passed to **Pivot Table**. *Forest Fires* datasets reports forest fires by the month and day they happened. We can aggregate all occurrences of forest fires by selecting *Count* as aggregation method and using *month* as row and *day* as column values. Since we are using *Count*, it does not matter what our *Values* variable will be, so we will leave it as is.


it does not matter what our *Values* variable will be -> maybe *Values* is unimportant (or something similar)

ajdapretnar · 2019-05-31T11:48:30Z

Comments addressed as well as possible.

janezd assigned BlazZupan and janezd May 30, 2019

janezd reviewed May 31, 2019

View reviewed changes

ajdapretnar added 2 commits May 31, 2019 13:46

Docs for Pivot Table

fed5280

Images for Pivot

a77a05d

ajdapretnar force-pushed the pivot-docs branch from d2b4787 to a77a05d Compare May 31, 2019 11:46

janezd merged commit d48b41f into biolab:master May 31, 2019


		Outputs

		- Pivot Table: contingency matrix as set in the widget


		![](images/Pivot-stamped.png)

		1. Discrete or numeric variable that will be used for row values. Numeric variables are considered as integers in this case. Variable values will appear as rows in the table.


		![](images/Pivot-discrete.png)

		Example of a pivot table with only discrete variables selected. We are using heart-disease data set for this example. We are using the values of diameter narrowing as row values, namely 0 and 1. Our columns are values of gender, namely female and male. We are using thal as values in our cells.


		![](images/Pivot-continuous.png)

		Example of a pivot table with numeric variables. We are using heart-disease data set for this example. We are using the values of diameter narrowing as row values, namely 0 and 1. Our columns are values of gender, namely female and male. We are using rest SBP as values in our cells.

Uh oh!

Conversation

ajdapretnar commented May 28, 2019

Issue

Description of changes

Includes

Uh oh!

codecov Bot commented May 28, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ajdapretnar commented May 31, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

codecov Bot commented May 28, 2019 •

edited

Loading