Sfoglia il codice sorgente

📚 Add text to readme for each section

pull/8/head
Garrick Aden-Buie 7 anni fa
parent
commit
a6dfac1c60
2 ha cambiato i file con 74 aggiunte e 11 eliminazioni
  1. +33
    -5
      README.Rmd
  2. +41
    -6
      README.md

+ 33
- 5
README.Rmd Vedi File



[gganimate]: https://github.com/thomasp85/gganimate#README [gganimate]: https://github.com/thomasp85/gganimate#README
[dplyr-two-table]: https://dplyr.tidyverse.org/articles/two-table.html [dplyr-two-table]: https://dplyr.tidyverse.org/articles/two-table.html
[r4ds-set-ops]: http://r4ds.had.co.nz/relation-data.html#set-operations
[r4ds]: http://r4ds.had.co.nz/
[r4ds-relational]: http://r4ds.had.co.nz/relational-data.html
[r4ds-set-ops]: http://r4ds.had.co.nz/relational-data.html#set-operations
[r4ds-tidy-data]: http://r4ds.had.co.nz/tidy-data.html#tidy-data-1
[tidyverse]: https://tidyverse.org
[tidyr]: https://tidyr.tidyverse.org


# Tidy Animated Verbs # Tidy Animated Verbs




### Relational Data ### Relational Data


The [Relational Data](http://r4ds.had.co.nz/relation-data.html) chapter of the
[R for Data Science](http://r4ds.had.co.nz/) book by Garrett Grolemund and Hadley Wickham
The [Relational Data][r4ds-relational] chapter of the
[R for Data Science][r4ds] book by Garrett Grolemund and Hadley Wickham
is an excellent resource for learning more about relational data. is an excellent resource for learning more about relational data.


The [dplyr two-table verbs vignette][dplyr-two-table] The [dplyr two-table verbs vignette][dplyr-two-table]


## Mutating Joins ## Mutating Joins


> A mutating join allows you to combine variables from two tables. It first matches observations by their keys, then copies across variables from one table to the other.
> [R for Data Science: Mutating joins](http://r4ds.had.co.nz/relational-data.html#mutating-joins)

```{r intial-dfs} ```{r intial-dfs}
source("R/00_base_join.R") source("R/00_base_join.R")
df_names <- data_frame( df_names <- data_frame(


## Filtering Joins ## Filtering Joins


> Filtering joins match observations in the same way as mutating joins, but affect the observations, not the variables.
> ... Semi-joins are useful for matching filtered summary tables back to the original rows.
> ... Anti-joins are useful for diagnosing join mismatches.
> [R for Data Science: Filtering Joins](http://r4ds.had.co.nz/relational-data.html#filtering-joins)

### Semi Join ### Semi Join


> All rows from `x` where there are matching values in `y`, keeping just columns from `x`. > All rows from `x` where there are matching values in `y`, keeping just columns from `x`.


## Set Operations ## Set Operations


> Set operations are occasionally useful when you want to break a single complex filter into simpler pieces.
> All these operations work with a complete row, comparing the values of every variable.
> These expect the x and y inputs to have the same variables, and treat the observations like sets.
> [R for Data Science: Set operations](http://r4ds.had.co.nz/relational-data.html#set-operations)

```{r intial-dfs-so} ```{r intial-dfs-so}
source("R/00_base_set.R") source("R/00_base_set.R")
df_names <- data_frame( df_names <- data_frame(


## Tidy Data ## Tidy Data


[Tidy data][r4ds-tidy-data] follows the following three rules:

1. Each variable has its own column.
1. Each observation has its own row.
1. Each value has its own cell.

Many of the tools in the [tidyverse] expect data to be formatted as a tidy dataset and the [tidyr] package provides functions to help you organize your data into tidy data.

```{r tidyr-wide-long} ```{r tidyr-wide-long}
source("R/tidyr_spread_gather.R") source("R/tidyr_spread_gather.R")




`spread(data, key, value)` `spread(data, key, value)`


> Spread a key-value pair across multiple columns.
> Spread a key-value pair across multiple columns.
> Use it when an a column contains observations from multiple variables.


`gather(data, key = "key", value = "value", ...)` `gather(data, key = "key", value = "value", ...)`


> Gather takes multiple columns and collapses into key-value pairs, duplicating all other columns as needed. You use `gather()` when you notice that you have columns that are not variables.
> Gather takes multiple columns and collapses into key-value pairs, duplicating all other columns as needed.
> You use `gather()` when you notice that your column names are not names of variables, but *values* of a variable.


![](images/tidyr-spread-gather.gif) ![](images/tidyr-spread-gather.gif)



+ 41
- 6
README.md Vedi File



### Relational Data ### Relational Data


The [Relational Data](http://r4ds.had.co.nz/relation-data.html) chapter
of the [R for Data Science](http://r4ds.had.co.nz/) book by Garrett
Grolemund and Hadley Wickham is an excellent resource for learning more
about relational data.
The [Relational Data](http://r4ds.had.co.nz/relational-data.html)
chapter of the [R for Data Science](http://r4ds.had.co.nz/) book by
Garrett Grolemund and Hadley Wickham is an excellent resource for
learning more about relational data.


The [dplyr two-table verbs The [dplyr two-table verbs
vignette](https://dplyr.tidyverse.org/articles/two-table.html) and Jenny vignette](https://dplyr.tidyverse.org/articles/two-table.html) and Jenny


## Mutating Joins ## Mutating Joins


> A mutating join allows you to combine variables from two tables. It
> first matches observations by their keys, then copies across variables
> from one table to the other.
> [R for Data Science: Mutating
> joins](http://r4ds.had.co.nz/relational-data.html#mutating-joins)

<img src="images/static/png/original-dfs.png" width="480px" /> <img src="images/static/png/original-dfs.png" width="480px" />


``` r ``` r


## Filtering Joins ## Filtering Joins


> Filtering joins match observations in the same way as mutating joins,
> but affect the observations, not the variables. … Semi-joins are
> useful for matching filtered summary tables back to the original rows.
> … Anti-joins are useful for diagnosing join mismatches.
> [R for Data Science: Filtering
> Joins](http://r4ds.had.co.nz/relational-data.html#filtering-joins)

### Semi Join ### Semi Join


> All rows from `x` where there are matching values in `y`, keeping just > All rows from `x` where there are matching values in `y`, keeping just


## Set Operations ## Set Operations


> Set operations are occasionally useful when you want to break a single
> complex filter into simpler pieces. All these operations work with a
> complete row, comparing the values of every variable. These expect the
> x and y inputs to have the same variables, and treat the observations
> like sets.
> [R for Data Science: Set
> operations](http://r4ds.had.co.nz/relational-data.html#set-operations)

<img src="images/static/png/original-dfs-set-ops.png" width="480px" /> <img src="images/static/png/original-dfs-set-ops.png" width="480px" />


``` r ``` r


## Tidy Data ## Tidy Data


[Tidy data](http://r4ds.had.co.nz/tidy-data.html#tidy-data-1) follows
the following three rules:

1. Each variable has its own column.
2. Each observation has its own row.
3. Each value has its own cell.

Many of the tools in the [tidyverse](https://tidyverse.org) expect data
to be formatted as a tidy dataset and the
[tidyr](https://tidyr.tidyverse.org) package provides functions to help
you organize your data into tidy data.

![](images/static/png/original-dfs-tidy.png) ![](images/static/png/original-dfs-tidy.png)


``` r ``` r


`spread(data, key, value)` `spread(data, key, value)`


> Spread a key-value pair across multiple columns.
> Spread a key-value pair across multiple columns. Use it when an a
> column contains observations from multiple variables.


`gather(data, key = "key", value = "value", ...)` `gather(data, key = "key", value = "value", ...)`


> Gather takes multiple columns and collapses into key-value pairs, > Gather takes multiple columns and collapses into key-value pairs,
> duplicating all other columns as needed. You use `gather()` when you > duplicating all other columns as needed. You use `gather()` when you
> notice that you have columns that are not variables.
> notice that your column names are not names of variables, but *values*
> of a variable.


![](images/tidyr-spread-gather.gif) ![](images/tidyr-spread-gather.gif)



Loading…
Annulla
Salva