In this project you will become familiar with the xv6 virtual memory system and work to add a few features that are common in modern OSes. The project is composed of two parts, which you must complete in order. This project can be completed in pairs or individually.

In part 1, you will change the virtual memory space of user processes so that the address 0x0 is invalid. Currently, 0x0 (NULL) is a valid address for an xv6 user process. You will change xv6 so that dereferencing address 0x0 causes a page fault.

Part 2 (50 points)
In part 2, you will implement a copy-on-write fork in xv6. This is an important performance optimization. Currently, the fork syscall copies all of a process' memory. Most unix-based OSes use copy-on-write instead, and this allows the OS to fork with very little effort.

Technically, part 2 can be developed independently of part1. However, part 2 is more difficult that part 1 and your experience with part 1 will help you with part 2, so we highly recommend completing part 1, then part 2. Do not split parts 1 and 2 between the two partners because this will take longer overall.

Educational Objectives

Part 0: Getting a fresh clone

You should start from a fresh copy of the xv6 code. Join Project 2 on Github Classroom and clone the repository:

After you've done this, you should get an email with a link to a private repository like https://github.com/starzia-teaching/project-1-GROUPNAME

From the github web interface you can click "clone of download" and then "use HTTPS" to get your group's repository URL (used below). You can clone the repo using the following command:

Notice that I added my github username to the user above, before "@github.com". If you get an error related to "gnome-ssh-askpass" then try running "unsetenv SSH_ASKPASS" or "unset SSH_ASKPASS".

Part 1: Making the null pointer invalid

In your prior C programming experience, you probably became somewhat familiar with the null pointer. In particular, you know that dereferencing a pointer whose value is 0x0 (NULL) causes a segmentation fault. The null pointer serves as a convenient sentinel value because, in most operating systems, address 0x0 is reserved for system use, which means that user programs should not be accessing that address anyway.

However, in xv6, address 0x0 is a valid virtual address. A user process’s address space goes from 0x0 to 0x7FFFFFFF (KERNBASE-1). This means that you can write a C program that dereferences a null pointer and run it on xv6 without causing a segmentation fault. Give it a try! You will be able to use this program later to test your work.

Your Task

Your first task is to modify how xv6 manages the user virtual address space so that 0x0 is no longer a valid address. In particular, if a user program tries to dereference a null pointer, xv6 should trap and kill the process. Luckily for you, xv6 does this automatically for invalid memory accesses.

Your modifications must not prevent xv6 from functioning normally. In other words, don’t break anything.

Guidance and hints

Part 2: Copy-on-write fork

The fork syscall creates a duplicate of the current process. Making a full copy of the process' memory will be slow if the process is large. It's also wasteful if the fork is immediately followed by an exec, which is often what happens. Recall that exec clears the process' memory and replaces it with the code loaded from an executable file (all that freshly copied memory is thrown out!).

Copy-on-write is a strategy that avoids this performance problem. Under this strategy, the copy is lazy. In other words, we delay copying until it's absolutely necessary. The kernel can do this by cleverly managing the page tables for both parent and child processes. Specifically, we allow the child process to read from the parent's copy of the shared memory page until either the parent or child writes to the page. At this point, both parent and child need their own copy of the page because they expect to see different values.

Notice that a child can fork again, leading to a page being shared by more than just two processes. We have to keep a shared page reference count to make sure that it's only cleaned up when no process is referring to it.

Your Tasks

Guidance and hints

Submission Instructions

Please add a file "team.txt" to your repository that gives the name, netid, and email address of both partners.

You will submit your solution through github classroom by simply committing your changes and pushing to your private repository. If you created new files for your solution then you will have to tell git to add these files to the project. Run "git status" and look for untracked files. Then run "git add <my_new_file>" to add that file to the project. For example, you'll have to run "git add team.txt". Review your changes with "git diff" and finally run "git commit -a" to commit your changes. You can see your changes relative to the starting point by running "gid diff 9e2e4f22b". Now push to your private github repository in github classroom by running "git push".

You should go to the github classroom website to verify that all your new code appears when you view the list of commits. You should also test it by cloning the repository again (to a different folder) and testing that it works.

NOTE: If you submit your code before the deadline and then realize you need to change something, you can just push an update with the fix. You might find it useful to push prior to the deadline when you want to share your progress with your partner.

In Canvas, list:

Names and netids of both partners
Your Github classroom group name (this will allow us to find your submission).
Github usernames for both partners

Resources

Some useful GDB and QEMU commands: https://pdos.csail.mit.edu/6.828/2018/labguide.html
xv6 textbook: https://pdos.csail.mit.edu/6.828/2018/xv6/book-rev11.pdf
xv6 code on github: https://github.com/mit-pdos/xv6-public
OS development wiki: https://wiki.osdev.org/Main_Page
This wiki has some good explanations of the CPU features used by the OS.

Project 2: Virtual Memory on xv6

Project Overview

Educational Objectives

Part 0: Getting a fresh clone

Part 1: Making the null pointer invalid

Your Task

Guidance and hints

Part 2: Copy-on-write fork

Your Tasks

Guidance and hints

Submission Instructions

Resources