[ General Information
| Other Pointers
[ Announcements ] [ Wish List ]
Course description: This course is for students interested in methods and tools for querying the Web as well as basic concepts and underlying principles. Topics include: (1) describing, transforming, and querying semi-structured data (XML, XSLT, XQUERY, SQL/XML); (2) understanding and navigating in Web pages (HTML, Web search engines, targeted Web query tools); and (3) capturing and manipulating semantic data (RDF, ontologies, semantic Web services). Students will do a course project; there will also be weekly or bi-weekly homeworks centered around the project and a midterm exam. | Prerequisites: CSE305. | Credits: 3.
Instructor: Annie Liu | Email: liuATcsDOTsunysbDOTedu | Office: Computer Science 1433 | Phone: 632-8463.
TA: Chen Zhao | Email: chzhaoATcsDOTsunysbDOTedu.
Hours: Tue Thu 3:50-5:10PM, in Computer Science 2129/31. | Annie's office hours: Tue 2:00-3:20PM, Thu 10-11:20AM, in CS 1433. | Chen's office hours: Wed 4:00-5:20PM, Fri 12:50-2:10PM, in CS 2110.
Textbook: The first part of the course will use XML and Web Data, Chapter 18 of Database and Transaction Processing: An Application-Oriented Approach in the upcoming new edition, Addison Wesley, by Michael Kifer, Arthur Bernstein, and Philip Lewis. The department will provide copies of the chapter to students in the course, with a small fee. The rest of the course will use introductory and tutorial materials provided throughout the course. You are asked to take good course notes yourself. Slides will be used for parts of the course and will be made available.
Grading: The midterm exam is worth 40% of the grade, the course project 30%, and homeworks together 30%. A bi-weekly homework is worth twice as much as a weekly homework, and all homeworks are also part of the course project. There will also be bonus points with the project and exam as they fit. No late handins will receive credit. Exceptions only when supported with official documents will be accommodated. The Pass/No Credit (P/NC) option is not available for this course.
Course homepage: http://www.cs.sunysb.edu/~liu/cse391/, containing all course related information.
Handout Q: Questionnaire
Handout Q2: Questionnaire 2
Slides Set 1: XML Part I
Slides Set 2: XML Part II
Slides with Texts: Semantic Web Tutorial Using N3 (all and only explanation part)
| primer (slides, all slides) + formats (slides, all slides)
| shortcuts (slides, all slides) + ontologies (slides, all slides)
| rules + processing (slides, all slides)
| application in travel tools (slides, all slides)
Handout A1: Assignment 1: A Web Query Tool
Handout A2: Assignment 2: XML Data Description
Handout A3: Assignment 3: XML Queries
Handout A4: Assignment 4: Project Description
Handout A5: Assignment 5: Project Design and Prototyping
Handout A6: Assignment 6: Project Implementation and Preliminary Experiments
Handout S2: Sample XML Schema Solution to Assignment 2
Handout S3: Sample XQuery Solution to Assignment 3
Handout D: Sample Data for Assignment 3 Demo
Handout MP: Midterm Practice Problems
Handout M: Midterm Exam
Handout MS: Solution to Midterm Exam
Handout P: Project Report, Presentation, and Demo
XML Languages and Technologies
XQuery | W3C XML Query (XQuery) Project
XQuery 1.0 and XPath 2.0 Functions and Operators
An XQuery implementation freeware
| Downloading QuiP
eXist: An open source XML database
The XML Revolution, Technologies for the future web: An overview of XML technologies | Contents
Other Web Programming Languages and Technologies
| W3C MarkUp Validation Service
PHP: Hypertext Preprocessor
A Tutorial on Java Servlets and Java Server Pages (JSP)
| An overview of Servlets and JSP
Dynamic Page Languages: An overview pre Servlets and JSP
Interactive Web Services with Java: An overview, including their own work on JWIG | Contents
Web Survey and Internet Research Reports by SecuritySpace | Web Server Markete Share | Apache Modules
Semantics Web Languages and Technologies
RDF | Developer tools
OWL Web Ontology Language Overview
Some General Tools (Independant of Application Domains)
HTTrack: A website copier
ixquick: A meta searcher
Xstrudel: An HTML generator | Tutorial
Some Specialized Tools (Specific to Application Domains)
Seeing negative feedbacks of ebay users (from Charles and Bin, 1/31/03)
Searching book prices (from Charles and Bin, 1/31/03)
You should learn all information on the course homepage. Check the homepage periodically for Announcements.
Do all course work. The homeworks and projects are integral parts of the course as they provide concrete experiences with the basic ideas covered in the class.All work must be done individually unless permitted explicitly. You are encouraged to discuss with others and look up references, but you should write up your own solutions independently and credit all sources that you used. Any plagiarism or other forms of cheating will result in an F or worse.
Computing facilities: You will be given an account in the Transaction Processing Lab. Never let anyone else use your account; it is against the rules. Please be conscious of security in the lab; theft or vandalism will be punished severely. If you have any problems with the hardware or software in the lab (other than with the requirements of the course work itself), please email ntadminATcsDOTsunysbDOTedu with a copy to me; neither the TA nor I could fix such problems.
Disability: If you have a physical, psychological, medical or learning disability that may have an impact on your ability to carry out assigned course work, please contact the staff in the Disabled Student Services office (DSS), Room 133 Humanities, 632-6748/TDD. DSS will review your concerns and determine with you what accommodations are necessary and appropriate. All information and documentation of disability are confidential.