Database management system notes | Schemes and Mind Maps Database Management Systems (DBMS)

DATABASE DESIGN: NORMALIZATION NOTE &

EXERCISES (Up to 3NF)

 Tables that contain redundant data can suffer from update anomalies, which can introduce

inconsistencies into a database.

 The rules associated with the most commonly used normal forms, namely first (1NF), second

(2NF), and third (3NF).

 The identification of various types of update anomalies such as insertion, deletion, and

modification anomalies can be found when tables that break the rules of 1NF, 2NF, and 3NF and

they are likely to contain redundant data and suffer from update anomalies.

 Normalization is a technique for producing a set of tables with desirable properties that support

the requirements of a user or company.

 Major aim of relational database design is to group columns into tables to minimize data

redundancy and reduce file storage space required by base tables.

 Take a look at the following example:

StdSSN StdCity StdClass OfferNo OffTerm OffYear EnrGrade CourseNo CrsDesc

S1 SEATTLE JUN O1 FALL 2006 3.5 C1 DB

S1 SEATTLE JUN O2 FALL 2006 3.3 C2 VB

S2 BOTHELL JUN O3 SPRING 2007 3.1 C3 OO

S2 BOTHELL JUN O2 FALL 2006 3.4 C2 VB

 The insertion anomaly: Occurs when extra data beyond the desired data must be added to the

database. For example, to insert a course (CourseNo), it is necessary to know a student

(StdSSN) and offering (OfferNo) because the combination of StdSSN and OfferNo is the

primary key. Remember that a row cannot exist with NULL values for part of its primary key.

 The update anomaly: Occurs when it is necessary to change multiple rows to modify ONLY a

single fact. For example, if we change the StdClass of student S1 (JUN), two rows, row 1

and 2 must be changed. If S1 was enrolled in 10 classes, 10 rows must be changed.

 The deletion anomaly: Occurs whenever deleting a row inadvertently causes other data to be

deleted. For example, if we delete the enrollment (EnrGrade) of S2 in O3 (third row), we lose

the information about offering O3 and course C3 because these values are unique to the table

(cell). Furthermore O3 is a primary key.

RECAP

Problems associated with data redundancy are illustrated by comparing the Staff and Branch

tables with the StaffBranch table. Tables that have redundant data may have problems called

update anomalies, which are classified as insertion, deletion, or modification anomalies. See the

following Figure for an example of a table with redundant data called StaffBranch. There are

two main types of insertion anomalies, which we illustrate using this table.

Database management system notes, Schemes and Mind Maps of Database Management Systems (DBMS)