【施工中】MIT 6.828 lab 3: User Environments
JOS的environments基本可以理解成"process"进程的同义词,但是由于"process"是一个unix术语,因此使用environment这个词.
Part A: User Environments and Exception Handling
查看 kern/env.c文件,看到三个全局变量:
1 struct Env *envs = NULL; // All environments
2 struct Env *curenv = NULL; // The current env
3 static struct Env *env_free_list; // Free environment list
envs会在JOS启动后会指向一个Env structures的数组,表示JOS中的全部environments. 理论上,JOS kernel最多能支持NENV个同时运行的environments. 但是实际上不会远不会达到这个数量.
env_free_list是一个链表结构,用来存放当前没有在运行的Env structure.. 和page_free_list 类似.
curenv表示的是当前正在运行的environment,当JOS刚刚启动,第一个environment运行之前,curenv的值为NULL.
接下来我们来阅读一下inc/env.h文件
1
2 /* See COPYRIGHT for copyright information. */
3
4 #ifndef JOS_INC_ENV_H
5 #define JOS_INC_ENV_H
6
7 #include <inc/types.h>
8 #include <inc/trap.h>
9 #include <inc/memlayout.h>
10
11 typedef int32_t envid_t;
12
13 // An environment ID 'envid_t' has three parts:
14 //
15 // +1+---------------21-----------------+--------10--------+
16 // |0| Uniqueifier | Environment |
17 // | | | Index |
18 // +------------------------------------+------------------+
19 // \--- ENVX(eid) --/
20 //
21 // The environment index ENVX(eid) equals the environment's index in the
22 // 'envs[]' array. The uniqueifier distinguishes environments that were
23 // created at different times, but share the same environment index.
24 //
25 // All real environments are greater than 0 (so the sign bit is zero).
26 // envid_ts less than 0 signify errors. The envid_t == 0 is special, and
27 // stands for the current environment.
28
29 #define LOG2NENV 10
30 #define NENV (1 << LOG2NENV)
31 #define ENVX(envid) ((envid) & (NENV - 1))
32
33 // Values of env_status in struct Env
34 enum {
35 ENV_FREE = 0,
36 ENV_DYING,
37 ENV_RUNNABLE,
38 ENV_RUNNING,
39 ENV_NOT_RUNNABLE
40 };
41
42 // Special environment types
43 enum EnvType {
44 ENV_TYPE_USER = 0,
45 };
46
47 struct Env {
48 struct Trapframe env_tf; // Saved registers
49 struct Env *env_link; // Next free Env
50 envid_t env_id; // Unique environment identifier
51 envid_t env_parent_id; // env_id of this env's parent
52 enum EnvType env_type; // Indicates special system environments
53 unsigned env_status; // Status of the environment
54 uint32_t env_runs; // Number of times environment has run
55
56 // Address space
57 pde_t *env_pgdir; // Kernel virtual address of page dir
58 };
59
60 #endif // !JOS_INC_ENV_H
61
* **env_tf**: 用来在切换环境时保存各种register的值,以便之后恢复现场.
* **env_link**: 用于构成一个链表结构,指向喜爱一个空闲的 environment.
* **env_id**: 用于唯一标识使用当前这个Env structure(也就是envs数组中的某个位置)的environment的ID.当这个environment终止时,**envs数组中的用一个位置可能会被re-allocate一个新的environment,但是env_id是不同的.**虽然env_id不同,但是env_id的最后10bit是用来标识在envs的下标的,如果使用的是envs数组中的同一个位置,这部分是相同的.具体可以参考inc/env.h.
* **env_parent_id**: 创建这个environment的environment 的env_id. 就是父进程id...
* **env_type: **用于区分不同种类的环境.对于大部分环境,类型都是**ENV_TYPE_USER.******
* **env_status**: 用来标识当前这个environment的状态.
* ENV_FREE: 标识一个environment是inactive的,因此在env_free_list上.
* ENV_RUNNABLE: 标识一个environment等待运行在处理器上.
* ENV_RUNNING:标识正在运行
* ENV_NOT_RUNNABLE: 标识一个目前active的环境,但是没有准备好运行,原因可能是正在等待一个其他environment的交互.
* ENV_DYING: 可以类比"僵尸进程"
* **env_pgdir: 当前这个environment的page directory.**
Allocating the Environments Array
Modify `mem_init()` in kern/pmap.c to allocate and map the `envs` array. This array consists of exactly `NENV` instances of the `Env` structure allocated much like how you allocated the `pages` array. Also like the `pages` array, the memory backing `envs`should also be mapped user read-only at `UENVS` (defined in inc/memlayout.h) so user processes can read from this array.You should run your code and make sure
check_kern_pgdir()succeeds.
和mem_init中申请pages的空间是一样的.
envs = (struct Env *)boot_alloc(sizeof(struct Env*)*NENV);
后面有空再写
Creating and Running Environments
由于现在JOS还没有一个文件系统,因此将可执行文件以ELF格式嵌入到kernel中.
练习2:
In the file env.c, finish coding the following functions:`env_init()`Initialize all of the `Env` structures in the `envs` array and add them to the `env_free_list`. Also calls `env_init_percpu`, which configures the segmentation hardware with separate segments for privilege level 0 (kernel) and privilege level 3 (user).`env_setup_vm()`Allocate a page directory for a new environment and initialize the kernel portion of the new environment's address space.`region_alloc()`Allocates and maps physical memory for an environment`load_icode()`You will need to parse an ELF binary image, much like the boot loader already does, and load its contents into the user address space of a new environment.`env_create()`Allocate an environment with `env_alloc` and call `load_icode` to load an ELF binary into it.`env_run()`Start a given environment running in user mode.As you write these functions, you might find the new cprintf verb
%euseful – it prints a description corresponding to an error code. For example,r = -E_NO_MEM; panic("env_alloc: %e", r);will panic with the message “env_alloc: out of memory”.
env_init()是这些函数中最简单的一个,只需要注意为了保证env_free_list的顺序和envs array的相同,需要倒序遍历.
1// Mark all environments in 'envs' as free, set their env_ids to 0,
2// and insert them into the env_free_list.
3// Make sure the environments are in the free list in the same order
4// they are in the envs array (i.e., so that the first call to
5// env_alloc() returns envs[0]).
6//
7void
8env_init(void)
9{
10 // Set up envs array
11 // LAB 3: Your code here.
12 cprintf("env_init start\n");
13 for ( int i = NENV-1 ; i >= 0 ; i--)
14 {
15 envs[i].env_id = 0;
16 envs[i].env_status = ENV_FREE;
17 envs[i].env_link = env_free_list;
18 env_free_list = &envs[i];
19 }
1 // Per-CPU part of the initialization
2 cprintf("env_init before init_precpu\n");
3 env_init_percpu();
4 cprintf("env_init end\n");
5}
接下来看env_setup_vm函数,作用是为新的environment初始化一个page directory. 每个environment都有一个自己的page directory.
注释里提示用kern_pgdir作为一个template… 说实话完全没get到含义…查阅资料发现其实就是…copy过来的意思…orz 注意头文件string.h中可能有些有用的函数,比如memcpy
1// Initialize the kernel virtual memory layout for environment e.
2// Allocate a page directory, set e->env_pgdir accordingly,
3// and initialize the kernel portion of the new environment's address space.
4// Do NOT (yet) map anything into the user portion
5// of the environment's virtual address space.
6//
7// Returns 0 on success, < 0 on error. Errors include:
8// -E_NO_MEM if page directory or table could not be allocated.
9//
10static int
11env_setup_vm(struct Env *e)
12{
1 cprintf("env_setup_vm start\n");
2 int i;
3 struct PageInfo *p = NULL;
1 // Allocate a page for the page directory
2 if (!(p = page_alloc(ALLOC_ZERO)))
3 return -E_NO_MEM;
1 // Now, set e->env_pgdir and initialize the page directory.
2 //
3 // Hint:
4 // - The VA space of all envs is identical above UTOP
5 // (except at UVPT, which we've set below).
6 // See inc/memlayout.h for permissions and layout.
7 // Can you use kern_pgdir as a template? Hint: Yes.
8 // (Make sure you got the permissions right in Lab 2.)
9 // - The initial VA below UTOP is empty.
10 // - You do not need to make any more calls to page_alloc.
11 // - Note: In general, pp_ref is not maintained for
12 // physical pages mapped only above UTOP, but env_pgdir
13 // is an exception -- you need to increment env_pgdir's
14 // pp_ref for env_free to work correctly.
15 // - The functions in kern/pmap.h are handy.
1 // LAB 3: Your code here.
2 // 去看一下string.h,里面有些函数可能有用
3 p->pp_ref++;
4 e->env_pgdir = (pde_t *)KADDR(page2pa(p));
5 memcpy(e->env_pgdir,kern_pgdir,PGSIZE);
6 // 不知道有memcpy函数可以用...orz
1 cprintf("PA e->env_pgdir:x\n",PADDR(e->env_pgdir));
2 // UVPT maps the env's own page table read-only.
3 // Permissions: kernel R, user R
4 e->env_pgdir[PDX(UVPT)] = PADDR(e->env_pgdir) | PTE_P | PTE_U;
5 cprintf("env_setup_vim end\n");
return 0;
}
接下来是函数region_alloc,作用是为一个函数分配并映射物理内存.使用page_insert即可
1// Allocate len bytes of physical memory for environment env,
2// and map it at virtual address va in the environment's address space.
3// Does not zero or otherwise initialize the mapped pages in any way.
4// Pages should be writable by user and kernel.
5// Panic if any allocation attempt fails.
6//
7static void
8region_alloc(struct Env *e, void *va, size_t len)
9{
10 cprintf("region_alloc start\n");
11 // LAB 3: Your code here.
12 // (But only if you need it for load_icode.)
13 // use page_insert
14 uintptr_t VA = ROUNDDOWN((uintptr_t)va,PGSIZE);
15 uintptr_t VA_end = ROUNDUP((uintptr_t)(va + len),PGSIZE);
16 struct PageInfo *pginfo = NULL;
17 for (; VA < VA_end ; VA+=PGSIZE)
18 {
19 if (!(pginfo=page_alloc(ALLOC_ZERO)))
20 {
21 panic("page_alloc in region_alloc failed!");
22 }
23 int ret = page_insert(e->env_pgdir, pginfo, (void *)VA, PTE_U | PTE_W | PTE_P);
24 if (ret)
25 {
26 panic("page insert failed %e",ret);
27 }
28 }
29 cprintf("rgion_alloc end\n");
1 // Hint: It is easier to use region_alloc if the caller can pass
2 // 'va' and 'len' values that are not page-aligned.
3 //
4 // You should round va down, and round (va + len) up.
5 // (Watch out for corner-cases!)
6}
接下里要实现的函数是load_icode(),作用是将ELF格式的二进制文件加载到新环境的用户地址空间.
这个函数是相对比较难实现的一个,可以参考boot/main.c中boot loader加载 kernel image的过程.区别在于,boot loader是从disk中加载kernel image,而load_icode()要加载的二进制文件已经在memory中了.
实现参考lab3 抢占式调度
主要的难点在于在load program segment的时候,是load到user的environment,因此需要在load之前使用lcr3指令切换到当前environment的page dir.
以及在指定完program的entry point之后,需要将page dir切换回kern_pgdir
还有就是指定入口点的方法,是将该环境的指令寄存器eip的值设置为该elf格式文件的e_entry的值
1// Set up the initial program binary, stack, and processor flags
2// for a user process.
3// This function is ONLY called during kernel initialization,
4// before running the first user-mode environment.
5//
6// This function loads all loadable segments from the ELF binary image
7// into the environment's user memory, starting at the appropriate
8// virtual addresses indicated in the ELF program header.
9// At the same time it clears to zero any portions of these segments
10// that are marked in the program header as being mapped
11// but not actually present in the ELF file - i.e., the program's bss section.
12//
13// All this is very similar to what our boot loader does, except the boot
14// loader also needs to read the code from disk. Take a look at
15// boot/main.c to get ideas.
16//
17// Finally, this function maps one page for the program's initial stack.
18//
19// load_icode panics if it encounters problems.
20// - How might load_icode fail? What might be wrong with the given input?
21//
22static void
23load_icode(struct Env *e, uint8_t *binary)
24{
25 cprintf("load icode start\n");
26 // Hints:
27 // Load each program segment into virtual memory
28 // at the address specified in the ELF segment header.
29 // You should only load segments with ph->p_type == ELF_PROG_LOAD.
30 // Each segment's virtual address can be found in ph->p_va
31 // and its size in memory can be found in ph->p_memsz.
32 // The ph->p_filesz bytes from the ELF binary, starting at
33 // 'binary + ph->p_offset', should be copied to virtual address
34 // ph->p_va. Any remaining memory bytes should be cleared to zero.
35 // (The ELF header should have ph->p_filesz <= ph->p_memsz.)
36 // Use functions from the previous lab to allocate and map pages.
37 //
38 // question: how to copy memory? for-loop and assgin?
39 // // no! use memcpy in string.h
40 //
41 // All page protection bits should be user read/write for now.
42 // ELF segments are not necessarily page-aligned, but you can
43 // assume for this function that no two segments will touch
44 // the same virtual page.
45 //
46 // You may find a function like region_alloc useful.
47 // Q: how to use region_alloc?
48 //
49 // Loading the segments is much simpler if you can move data
50 // directly into the virtual addresses stored in the ELF binary.
51 // So which page directory should be in force during
52 // this function?
53 //
54 // You must also do something with the program's entry point,
55 // to make sure that the environment starts executing there.
56 // What? (See env_run() and env_pop_tf() below.)
1 // LAB 3: Your code here.
2 struct Elf * elf = (struct Elf *)binary;
3 cprintf("elf->e_magic :x\n",elf->e_magic);
4 if (elf->e_magic != ELF_MAGIC)
5 {
6 panic("invalid ELF file!");
7 }
8 struct Proghdr *ph,*eph;
9 ph = (struct Proghdr *)((uint8_t *)elf + elf->e_phoff);
10 eph = ph + elf->e_phnum;
11 cprintf("ph: x eph:x\n",ph,eph);
12 cprintf("binary:x p_offset:x filesz:x\n",binary,ph->p_offset,ph->p_filesz);
13 lcr3(PADDR(e->env_pgdir));
1 for (; ph < eph ; ph++)
2 {
3 //cprintf("ph->ptype: %d\n",ph->p_type);
4 // ph->ptype should be 1 instead of 0 .
5 if (ph->p_type != ELF_PROG_LOAD) continue;
6 uint8_t * src = binary + ph->p_offset;
7 uint8_t * dst = (uint8_t *)ph->p_va;
8 region_alloc(e,(void *)dst, ph->p_memsz);
9 memcpy(dst,src,ph->p_filesz);
10 // use memcpy .
11 memset(dst + ph->p_filesz, 0, ph->p_memsz - ph->p_filesz);
12 }
13 e->env_tf.tf_eip = elf->e_entry;
14 lcr3(PADDR(kern_pgdir));
15 // lcr3 is too hard for me...
cprintf("load_icode before env_run\n");
//env_run(e);
1 // Now map one page for the program's initial stack
2 // at virtual address USTACKTOP - PGSIZE.
3 region_alloc(e, (void*)(USTACKTOP - PGSIZE), PGSIZE);
4 // LAB 3: Your code here.
5}
接下来是函数env_create(),作用是申请一个新的environment,加载二进制文件,没有什么难度
1// Allocates a new env with env_alloc, loads the named elf
2// binary into it with load_icode, and sets its env_type.
3// This function is ONLY called during kernel initialization,
4// before running the first user-mode environment.
5// The new env's parent ID is set to 0.
6//
7void
8env_create(uint8_t *binary, enum EnvType type)
9{
10 cprintf("env_create start\n");
11 // LAB 3: Your code here.
12 struct Env *env;
13 int ret = env_alloc( &env, 0);
14 if (ret)
15 {
16 panic("env_alloc:%e", ret);
17 }
cprintf("env_create before load_icode\n");
1 env->env_type = type;
2 load_icode(env, binary);
3 env_run(env);
}
之后是env_run,也没什么难度.
1//
2// Context switch from curenv to env e.
3// Note: if this is the first call to env_run, curenv is NULL.
4//
5// This function does not return.
6//
7void env_run(struct Env *e)
8{
9 cprintf("env_run start\n");
10 // Step 1: If this is a context switch (a new environment is running):
11 // 1. Set the current environment (if any) back to
12 // ENV_RUNNABLE if it is ENV_RUNNING (think about
13 // what other states it can be in),
14 // 2. Set 'curenv' to the new environment,
15 // 3. Set its status to ENV_RUNNING,
16 // 4. Update its 'env_runs' counter,
17 // 5. Use lcr3() to switch to its address space.
18 // Step 2: Use env_pop_tf() to restore the environment's
19 // registers and drop into user mode in the
20 // environment.
1 // Hint: This function loads the new environment's state from
2 // e->env_tf. Go back through the code you wrote above
3 // and make sure you have set the relevant parts of
4 // e->env_tf to sensible values.
1 // LAB 3: Your code here.
2 if (curenv && curenv->env_status == ENV_RUNNING)
3 {
4 curenv->env_status = ENV_RUNNABLE;
5 }
6 cprintf("miao 1 in env_run\n");
7 curenv = e;
8 curenv->env_status = ENV_RUNNING;
9 curenv->env_runs++;
10 cprintf(" miao 2 in env_run\n");
11 // env_pgdir[0] == 0, which is wrong.
12 cprintf("env_run curenv->env_pgdir :x\n", curenv->env_pgdir[PDX(UVPT)]);
13 lcr3(PADDR(curenv->env_pgdir));
14 cprintf("miao 3 in env_run\n");
env_pop_tf(&curenv->env_tf);
cprintf("miao 4 in env_run\n");
panic("env_run not yet implemented");
}
代码实现完了,但是发现直接crash了..显示triple fault
不要慌,问题不大.这是因为JOS目前还没有允许user space转化到kernel的机制,因此抛出了"general protection exception" . 然后JOS也处理不了抛出的"general protection exception",因此再次抛出异常. 最后以抛出triple fault结束.
但是我们肯定需要一个办法来验证我们的实现是对的.
方式是使用gdb,在kern/env.c的env_pop_tf函数上设置断点.该函数是进入user mode之前运行的最好一个函数.然后单步发现运行的恰好是lib/entry.S中start label后的几个
#include <inc/mmu.h>
#include <inc/memlayout.h>
.data
// Define the global symbols 'envs', 'pages', 'uvpt', and 'uvpd'
// so that they can be used in C as if they were ordinary global arrays.
.globl envs
.set envs, UENVS
.globl pages
.set pages, UPAGES
.globl uvpt
.set uvpt, UVPT
.globl uvpd
.set uvpd, (UVPT+(UVPT>>12)*4)
1// Entrypoint - this is where the kernel (or our parent environment)
2// starts us running when we are initially loaded into a new environment.
3.text
4.globl _start
5_start:
6 // See if we were started with arguments on the stack
7 cmpl $USTACKTOP, %esp
8 jne args_exist
1 // If not, push dummy argc/argv arguments.
2 // This happens when we are loaded by the kernel,
3 // because the kernel does not know about passing arguments.
4 pushl $0
5 pushl $0
1args_exist:
2 call libmain
31: jmp 1b
然后查看obj/user/hello.asm 文件,找到sys_cputs()函数中的 int $0x30指令(在第1934行左右),该指令对应的user-space address是0x00800add
在该地址处设置断点,如果可以顺利执行到 0x800add: int $0x30 ,说明之前的实现没有问题.