Applied Science, Computing and Engineering
BSc (Hons) Computing
Module: COM636 Distributed Data
COM617: Assignment 1
The assignment will entail reading a range of academic papers, journals and books. You will be writing reports and designing Distributed Database Systems and Data Warehouse Systems.
Task 1: Distributed Database Design (60%)
Happy Cruise Lines is a cruise company with several ships and a variety of cruise itineraries, each involving a several ports of call. The company headquarters is in New York and has regional offices in the cruise port cities of Miami, Houston, and Los Angeles.
The company maintains information of ships, cruises, ports, passengers and voyage. CRUISE is a particular sailing of a ship on a particular date. VISITS records the ships visits to the ports during the cruise. VOYAGE records the passengers for a cruise. Given below is the snapshot of Happy Cruise Lines’ database which lists the attributes for each entity along with the primary key and foreign keys.
SHIPS: (SHIPNUM, SHIPNAME, BUILDER, LAUNCHDATE, WEIGHT)
PASSENGER: (PASSENGERNUM, PASSENGERNAME, SOCIALSECNUM, STATE, COUNTRY)
PORT: (PORTNUM, COUNTRY, NUMDOCKS, MANAGER)
CRUISE: (CRUISENUM, STARTDATE, ENDDATE, DIRECTOR, SHIPNUM)
VISIT: (CRUISENUM, PORTNUM, COUNTRY, ARRDATE, DEPTDATE)
VOYAGE: (PASSENGERNUM, CRUISENUM, ROOMNUM, FARE)
SHIPS: The Company has 20 ships and the details of these ships are used in every office.
CRUISE: There are around 4000 cruise records; cruise records are used most heavily in the cities from which the cruise described in the record began.
PORT: There are 42 records in port table. The records describe Atlantic Ocean ports are used most heavily in New York and Miami. The records that describe Caribbean Sea ports are used most heavily in Huston and Miami. The records that described Pacific Ocean ports are used most heavily in Los Angeles.
VISIT: Consists of 15,000 records and is primarily used in the New York (headquarters) and Los Angeles.
PASSENGER: Consists of 230,000 records and is primarily used in the New York (headquarters) and Los Angeles offices.
VOYAGE: Consists of 720,000 records and is used in all four offices.
- Compare the various data distribution methods/approaches over Distributed Databases. Critically select an appropriate data distribution solution for Happy Cruise Lines with justifiable reasons.
40% (Approx. 1500 words)
- Design a data distribution scheme in-line with your solution
20% (Design Diagram)
Task 2: Data Warehouse (40%)
Millennium College offers a number of courses over different subject (Major) area. Courses are managed by professors who are employed in different departments. The college records the course details, student details and their results. The college decides to implement data warehouse. One of the requirements is to record grades for courses completed by students. Initial analysis identified the following dimensions.
CourseSection: CourseID, CourseName, Units, SectionNumber, RoomID, RoomCapacity. (During a given Semester the college offers an average of 500 course sections).
Professor: ProfID, ProfName, Title, DepartmentID, DepartmentName. (There are around 200 professors)
Student: StudentID, StudentName, Major. (Each course section has an average 40 students, each student may take up to 5 course section per period).
Period: SemesterID, Year (College follow a trimester system, and it has records for the last 10 years.)
- Discuss the concepts associated with dimensional modelling, its association with Data Warehouse design and explain Star Schema and Snow Flake Schema.
20% (Approx. 750 words)
- Design a Star Schema for modelling the data warehouse for Millennium College. Ensure every dimension have an appropriate Primary Key. You must identify the FACT table with appropriate details (facts) and keys for the dimensions. Assign an appropriate name for the FACT table.
- Using the principles of normalisation convert the Star Schema to a Snow flake Schema.
20% (b& c - Schema Diagrams)
Students will get assistance to complete the tasks through the tutorial sessions. Drafts will be reviewed and formative feedback will be given in the tutorial sessions. So you are much less likely to obtain a good grade if you don’t attend the tutorial sessions.
Support your work with relevant references from books, journals and other quality information resources using the IEEE Referencing system.
Failure to complete any of the above tasks or sub tasks in time will result in a loss of marks.
The assignment should be word-processed. This specification document should be filed at the front of the assignment, with the front sheet (with your Name, Student Number, Date and Signature) visible at the front.
Work must be word processed and submitted as a single file via Turnitin link provided through VLE (Moodle) by 10th August 2017.
The Glyndwr policy on assignment submission will be rigidly adhered to (see your Student Handbook).
- Critically assess some of the more advanced developments in database technology, e.g. Distributed databases, Multidimensional Database, Multimedia Databases, Mobile Databases
- Evaluate the current issues associated with theory to practical implementations in database research.
- Explore advanced aspects of data warehousing, distributed data, data intensive computing, remote access and personalised data; encompassing the principles, research results and commercial application of the technologies.
Key skills for employability:
- Written, oral and media communication skills
- Leadership, team working and networking skills
- Opportunity, creativity and problem solving skills
- Information technology skills and digital literacy
- Information management skills
- Research skills
- Intercultural and sustainability skills
- Career management skills
- Learning to learn (managing personal and professional development, self-management)
In order to achieve an A grade, the work must be excellent in almost all respects, only very minor limitations.
In order to achieve a B+ grade, the work should show strength in most respects, but perhaps has limitations in one or two areas. A good piece of work nevertheless.
In order to achieve a B grade, the work should be of a satisfactory standard, showing strength in some areas, but perhaps let down by poor presentation, poor practical work, or poor written explanations where required.
In order to achieve a C grade, the work should be of a satisfactory standard but may have significant shortcomings in some areas. Nevertheless shows at least a basic understanding of the concepts and a basic practical ability.
A Refer grade will be given to work that is just unsatisfactory