Semestr: Winter
Range: 2P+2C
Completion:
Credits: 6
Programme type:
Study form: Fulltime
Course language: English
The aim of the course is to introduce students to new trends in the field of database systems. We will focus primarily on current issues related to Big Data and the associated challenges of distributed data storage and processing. The course will present the basic types of so-called NoSQL databases, as well as related topics such as cloud computing, data warehouses, and distributed computing over large datasets.
Big data, NoSQL, MapReduce, Scaling, CAP theorem, Column database, Graph database, NewSQL, Cloud computing, Data warehouses, Data generation, Benchmarking
1. Introduction to Big Data and NoSQL databases
2. Comparison and classification of NoSQL databases
3. Fundamentals of Big Data management (CAP, distribution, scaling, replication)
4. Key-Value databases
5. Document databases
6. Wide-column databases
7. Graph databases – basic principles
8. Graph databases – advanced aspects
9. Advanced principles of Big Data (transactions, performance, polyglot persistence)
10. The principle of MapReduce
11. Data warehouses and Big Data
11. Cloud computing
13. Advanced principles of Big Data (transactions, performance, polyglot persistence)
14. Reserve
1. Introduction to Big Data and NoSQL databases
2. Comparison and classification of NoSQL databases
3. Data formats (XML, JSON)
4. Key-Value databases (Redis)
5. Key-Value databases (Redis, Redis Stack)
6. Document databases (MongoDB) – creating, updating, and deleting records
7. Document databases (MongoDB) – search queries
8. Document databases (MongoDB) – aggregation queries
9. Wide-column databases
10. Graph databases – basic CRUD operations
11. Graph databases – graph traversal and search
12. MapReduce and HDFS
13. Final project demonstration
14. Reserve
1. http://nosql-database.org/
2. Pramod J. Sadalage - Martin Fowler: NoSQL Distilled: A Brief Guide to the Emerging World of Polyglot
Persistence, 2012
3. Eric Redmond - Jim R. Wilson: Seven Databases in Seven Weeks: A Guide to Modern Databases and the NoSQL
Movement, 2012
4. Sherif Sakr - Eric Pardede: Graph Data Management: Techniques and Applications, 2011
5. Shashank Tiwari: Professional NoSQL, 2011
6. Hadoop: The Definitive Guide, by Tom White, 2nd edition, 2010
7. Martin Kleppmann. Designing Data-Intensive Applications: The Big Ideas Behind Reliable, Scalable, and
Maintainable Systems, 2017
8. Shannon Bradshaw, Eoin Brazil, Kristina Chodorow. Mongodb: The Definitive Guide: Powerful and Scalable Data,
2020
9. Jeff Carpenter, Eben Hewitt. Cassandra: The Definitive Guide. O’Reilly Media, 2016
10. Jeremy Nelson. Mastering Redis. Packt Publishing, 2016