Vignettes

#### HTML vignette series:

**Planned for `v1.9.8`**
- [ ] Quick tour of data.table
- [x] [Keys and fast binary search based subset](https://rawgit.com/wiki/Rdatatable/data.table/vignettes/datatable-keys-fast-subset.html)
- [x] [Secondary indices and auto indexing](https://rawgit.com/wiki/Rdatatable/data.table/vignettes/datatable-secondary-indices-and-auto-indexing.html)
- [x] **Joins vignette**. a) _joins_ vs _subsets_ -- extending binary search based subset to joins + conditional / non-equi joins, rolling and interval joins. b)  by=.EACHI, join + update feature. c) Document `i.col` usage as filed in #1038. d) Also cover about performance/advantages from #1232. 
- ~~[ ] Cover `get()` and `mget()`. E.g., http://stackoverflow.com/q/33785747/559784~~ covered in #4304
- [ ] Add about on= argument rationale in FAQ (#1623).
- [ ] FAQ 5.3 needs to mention that it's a _shallow_ copy that's done in order to restore over-allocation. Thanks to Jan for linking it in #1729.

---

**Future releases**
- [ ] data.table internals, performance aspects and _expressiveness_
- [ ] Reading multiple files (`fread` + `rbindlist`), ordering, ranking and set operations
- [ ] IDateTime vignette
- [ ] Document the difference between `data.table()` and `data.frame()` somewhere - relevant issues: #968, #877. Perhaps slightly more in detail in the FAQ.
- [ ] coursera FAQ
- [ ] Advanced `data.table` usage: 
  - [ ] NSE
  - [ ] ...
- [ ] Timings vignette (moving #520 here to get everything in one place, but not sure if we need it as a vignette since we've the Wiki with benchmarks/timings).
- [ ] `fread+fwrite` vignette, include also [Convenience features of fread](https://github.com/Rdatatable/data.table/wiki/Convenience-features-of-fread) wiki, also https://github.com/Rdatatable/data.table/issues/2855

---

**Finished:**
- [x] [Introduction to data.table](https://rawgit.com/wiki/Rdatatable/data.table/vignettes/datatable-intro-vignette.html) - data.table syntax, general form, subset rows in `i`, select / do in `j` and aggregations using `by`.
- [x] [Reference Semantics](https://rawgit.com/wiki/Rdatatable/data.table/vignettes/datatable-reference-semantics.html) (_add/update/delete_ columns by reference, and see that we can combine with `i` and `by` in the same way as before)
- [x] [Efficient reshaping using data.tables](https://rawgit.com/wiki/Rdatatable/data.table/vignettes/datatable-reshape.html)
- [x] Link to [this answer on SO](http://stackoverflow.com/a/27004566/559784) on `by=.EACHI` until the vignette is done.

---
#### Minor:
- [ ] Operations using `integer64`, and promoting it for _large integers_.

---

Notes (to update current vignettes based on feedbacks): Please let me know if I missed anything..
#### Introduction to data.table:
- [x] `order` in `i`.
- [x] Explain how to name columns in `j` while selecting/computing.
- [x] Emphasise that _keyby_ is applied _after_ obtaining the result on the computed result, not on the original data.table.
- [x] Mention new updates to `.SDcols` and cols in `with=FALSE` being able to select columns as `colA:colB`.
#### Reference semantics:
- [ ] Also explain all other relevant `set*` functions here.. (`setnames`, `setcolorder` etc..)
- [ ] Mainly `set`.
- [x] Explain that `1b) the := operator` is just defining ways to use it - the example there doesn't work as it just shows two different ways of using it -- Following [this comment](http://stackoverflow.com/questions/29870673/how-to-unquote-string-in-r-to-access-column-in-data-frame?noredirect=1#comment47913441_29870673).
#### Keys and fast binary search based subsets:
- [ ] Add an example of subset using integer/double keys.
- [ ] Difference in "nomatch" default in binary search based subsets.
- [ ] replacing NAs with binary search based subsets possible?
#### FAQ (most appropriate here, I think).
- [x] Update FAQ with issue on external pointer being NULL when reading an R object from file, for example, using `readRDS()`. Update [this SO post](http://stackoverflow.com/q/29614386/559784).
- [ ] Explain with example, on over allocating the data.table using `alloc.col()`, and when to use it (when you need to create multiple columns), and why. Update [this SO post](http://stackoverflow.com/q/29615181/559784).


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Vignettes #944

HTML vignette series:

Minor:

Introduction to data.table:

Reference semantics:

Keys and fast binary search based subsets:

FAQ (most appropriate here, I think).

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Vignettes #944

Description

HTML vignette series:

Minor:

Introduction to data.table:

Reference semantics:

Keys and fast binary search based subsets:

FAQ (most appropriate here, I think).

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions