- Breadth
- Depth
- Scale
- Target
Ahsan Ijaz
Tools | Abstraction |
---|---|
Hadoop | MapReduce |
PostgreSQL | Relational Algebra |
glm in R | Logistic regression |
Tableau | InfoVis |
Structures | Statistics |
---|---|
Management | Linear Algebra |
Relational Algebra | Analysis |
Standards | Ad hoc files |
Desktop | Cloud |
---|---|
Main memory | Distributed |
R | Hadoop |
Local files | S3, Azure... |
Hackers | Analysts |
---|---|
assume proficiency in R, Python | No programming knowledge |
Activities of users at terminals and most application programs should remain unaffected
Key idea: Programs that manipulate tabular data exhibit an algebric structure allowing reasoning and manipulation independently of physical data representation.
Convert all images from TIFF to PNG
Run 1000s of simulations.
Most frequent word in each document.
Histogram of words in each document.