CST363 Week 1 April 28th - May 6th
1. The key differences between relational tables and spread sheets is it's structure and schema. On a spreadsheet you can mix data types in a column/cell because there are no enforced schema where as a relational table has declared data types for their columns, so age would have to be an integer value. There's also no built in validation for types or wrong data type in spread sheet whereas a relational table would reject invalid data. The biggest one would be the ability to query. I've done queries on large datasets via excel before and it takes forever and isn't maintainable on a bigger scale versus relational tables where I can query and cross reference specific data from different tables to get the results quick.
2. From what I've learned so far, investing in a database means data integrity and consistency via schemas. From personal experience, I know that database are often compatible with other tools such as tableau, power bi, and excel. All of which are great tools to use to analyze and visualize data.
3. My learning goal for this course is to familiarize myself with SQL and data modeling, I want to go beyond simple sql statement such as select * from table where x = 'y' to more complex statements to prepare myself for real world data that are often incredibly huge and requires cross referencing multiple database to find a very specific dataset. I also want to know how to create my own database, host it, integrate it with a pre-existing project of mine to see how data are stored, how I can grab that data, change that data from the front end with some back end code to make it happen, and how I can visualize the data.
Comments
Post a Comment