Notes on the Reading Assignments

Please do not repost or otherwise distribute the materials available from this website. Some material is available freely on the web, other is behind a paywall, other is private and we only have permission to use the material in class, not distribute.

  1. Due date: Wed. March 31. Submit here.

    A Case Against SQL

    Explain briefly three peculiar behaviors of SQL


  2. Due date: Wed. April 7. Submit here.

    What goes around

    Read sections 1-5 and 10. The other sections are not recommended and we will not discuss them in class.

    Some suggested topics for discussion in your review:


  3. Due date: Wed. April 14. Submit here.

    PAX Read sections 1-4; we will discuss them in class. Sec. 5-7 are optional and will not be covered in class.

    Column Store Read sections 1 and 2, skim over Sec. 3; we will read Sec 4 later (see below)

    Some suggested topics for discussion in your review:


  4. Due date: Wed. April 21. Submit here.

    Query Compiler Read sections 1,2,3; skim over section 4 (in particular, check out Fig. 6, which should become clear).

    Some topics to focus on (may use them in your review)

    Optional: Vectorized v.s. Compiled Sections 1 and 2 are a short introduction to vectorized v.s. data driven query processing.


  5. Due date: Wed. April 28. Submit here.

    How good are they?

    Read the entire paper. We will discuss most of it in class.


  6. Due date: Mon. May 3. Submit here.

    Column Store You have already started to read this paper. Now read Sections 4.1, 4.4., 4.5 (skim over the rest of Sec. 4).

    In your review:


  7. Due date: Wed. May 12. Submit here.

    The case for learned indexes Read sections 1 and 2 (motivation and main idea). Only skim over sec. 3. Be aware that the technique described in section 3 and up is an overkill: read the blog:

    Why use learning? (blog)

    Optional: for newer, and simpler techniques that use regression in index structures, see here.


  8. Due date: Mon. May 17. Submit here.

    LSM (watch this short video first)

    This is a very informative, but rather dense paper. Read and review sections 1,2,3. I recommend reading the rest of the paper too, but it is optional and we will not discuss it in class.

    Optional: watch the conference presentation of this paper here start at 52:00.


  9. Due date: Mon. May 24 Submit here.

    MapReduce Read sections 1,2,3.

    A major step backwards (blog)

    Hive Read sections 1, 2, and skim through section 4 (focus on the optimizations)

    Suggestions for discussion topics:


  10. Due date: Wed. May 26 Submit here.

    Snowflake

    Read sections 1,2,3, skim over 4, and read Sec. 6. Suggestions for topics to address in your review: