mirror of
https://github.com/genodelabs/genode.git
synced 2025-01-10 06:52:56 +00:00
70d0b00dde
r3 contains the recent Nova upstream kernel version plus the Genode specific extensions and changes as known from r2. Additionally, the r3 branch * contains the assign_pci patch now directly, * adds support for cross CPU IPC, * fixes some issues with freeing up kernel memory part of r2 and * update the documentation a bit. Fixes #814
236 lines
11 KiB
Plaintext
236 lines
11 KiB
Plaintext
|
|
==========================================
|
|
How to use Genode with the NOVA hypervisor
|
|
==========================================
|
|
|
|
Norman Feske
|
|
|
|
|
|
When we started the development of Genode in 2006 at the OS Group of the TU
|
|
Dresden, it was originally designated to be the user land of a next-generation
|
|
and to-be-developed new kernel called NOVA. Because the kernel was not ready at
|
|
that time, we had to rely on intermediate solutions as kernel platform such as
|
|
L4/Fiasco and Linux during development. These circumstances led us to the
|
|
extremely portable design that Genode has today and motivated us to make Genode
|
|
available on the whole family of L4 microkernels. In December 2009, the day we
|
|
waited for a long time had come. The first version of NOVA was publicly
|
|
released:
|
|
|
|
:Official website of the NOVA hypervisor:
|
|
[http://hypervisor.org]
|
|
|
|
Besides the novel and modern kernel interface, NOVA has a list of features that
|
|
sets it apart from most other microkernels, in particular support for
|
|
virtualization hardware, multi-processor support, and capability-based
|
|
security.
|
|
|
|
|
|
Why bringing Genode to NOVA?
|
|
############################
|
|
|
|
NOVA is an acronym for NOVA OS Virtualization Architecture. It stands for a
|
|
radically new approach of combining full x86 virtualization with microkernel
|
|
design principles. Because NOVA is a microkernelized hypervisor, the term
|
|
microhypervisor was coined. In its current form, it successfully addresses
|
|
three main challenges. First, how to consolidate a microkernel system-call API
|
|
with a hypercall API in such a way that the API remains orthogonal? The answer
|
|
to this question lies in NOVA's unique IPC interface. Second, how to implement
|
|
a virtual machine monitor outside the hypervisor without spoiling
|
|
performance? The Vancouver virtual machine monitor that runs on top NOVA proves
|
|
that a decomposition at this system level is not only feasible but can yield
|
|
high performance. Third, being a modern microkernel, NOVA set out to pursue a
|
|
capability-based security model, which is a challenge on its own.
|
|
|
|
Up to now, the NOVA developers were most concerned about optimizing and
|
|
evaluating NOVA for the execution of virtual machines, not so much about
|
|
running a fine-grained decomposed multi-server operating system. This is where
|
|
Genode comes into play. With our port of Genode to NOVA, we contribute the
|
|
workload to evaluate NOVA's kernel API against this use case. We are happy to
|
|
report that the results so far are overly positive.
|
|
|
|
At this point, we want to thank the main developers of NOVA Udo Steinberg and
|
|
Bernhard Kauer for making their exceptional work and documentation publicly
|
|
available, and for being so responsive to our questions. We also greatly
|
|
enjoyed the technical discussions we had and look forward to the future
|
|
evolution of NOVA.
|
|
|
|
|
|
How to explore Genode on NOVA?
|
|
##############################
|
|
|
|
To download the NOVA kernel and integrate it with Genode, issue the following
|
|
command from within the 'base-nova' directory:
|
|
|
|
! make prepare
|
|
|
|
For the vesa driver on x86 the x86emu library is required and can be downloaded
|
|
and prepared by invoking the following command from within the 'libports'
|
|
directory:
|
|
|
|
! make prepare PKG=x86emu
|
|
|
|
For creating a preconfigured build directory prepared for compiling Genode for
|
|
NOVA, use the 'create_builddir' tool:
|
|
|
|
! <genode-dir>/tool/create_builddir nova_x86_32 BUILD_DIR=<build-dir>
|
|
|
|
This tool will create a fresh build directory at the location specified
|
|
as 'BUILD_DIR'. Provided that you have installed the
|
|
[http://genode.org/download/tool-chain - Genode tool chain], you can now build
|
|
the NOVA kernel via
|
|
|
|
! make kernel
|
|
|
|
For test driving Genode on NOVA directly from the build directory, you can use
|
|
Genode's run mechanism. For example, the following command builds and executes
|
|
Genode's graphical demo scenario on Qemu:
|
|
|
|
! make run/demo
|
|
|
|
|
|
Challenges
|
|
##########
|
|
|
|
From all currently supported base platforms of Genode, the port to NOVA was
|
|
the most venturesome effort. It is the first platform with kernel support for
|
|
capabilities and local names. That means no process except the kernel has
|
|
global knowledge. This raises a number of questions that seem extremely hard
|
|
to solve at the first sight. For example: There are no global IDs for threads
|
|
and other kernel objects. So how to address the destination for an IPC message?
|
|
Or another example: A thread does not know its own identity per se and there is
|
|
no system call similar to 'getpid' or 'l4_myself', not even a way to get a
|
|
pointer to a thread's own user-level thread-control block (UTCB). The UTCB,
|
|
however, is needed to invoke system calls. So how can a thread obtain its UTCB
|
|
in order to use system calls? The answers to these questions must be provided by
|
|
user-level concepts. Fortunately, Genode was designed for a capability kernel
|
|
right from the beginning so that we already had solutions to most of these
|
|
questions. In the following, we give a brief summary of the specifics of Genode
|
|
on NOVA:
|
|
|
|
* We maintain our own system-call bindings for NOVA ('base-nova/include/nova/')
|
|
derived from the NOVA specification. We put the bindings under MIT license
|
|
to encourage their use outside of Genode.
|
|
|
|
* Core runs directly as roottask on the NOVA hypervisor. On startup, core
|
|
maps the complete I/O port range to itself and implements debug output via
|
|
comport 0.
|
|
|
|
* Because NOVA does not allow rootask to have a BSS segment, we need a slightly
|
|
modified linker script for core (see 'src/platform/roottask.ld').
|
|
All other Genode programs use Genode's generic linker script.
|
|
|
|
* The Genode 'Capability' type consists of a portal selector expressing the
|
|
destination of a capability invocation.
|
|
|
|
* Thread-local data such as the UTCB pointer is provided by the new thread
|
|
context management introduced with the Genode release 10.02. It enables
|
|
each thread to determine its thread-local data using the current stack
|
|
pointer.
|
|
|
|
* NOVA provides threads without time called local execution contexts (EC).
|
|
Local ECs are used as server-side RPC handlers. The processing time
|
|
needed to perform RPC requests is provided by the client during the RPC call.
|
|
This way, RPC semantics becomes very similar to function call semantics with
|
|
regard to the accounting of CPU time. Genode already distinguishes normal
|
|
threads (with CPU time) and server-side RPC handlers ('Server_activation')
|
|
and, therefore, can fully utilize this elegant mechanism without changing the
|
|
Genode API.
|
|
|
|
* On NOVA, there are no IPC send or IPC receive operations. Hence, this part
|
|
of Genode's IPC framework cannot be implemented on NOVA. However, the
|
|
corresponding classes 'Ipc_istream' and 'Ipc_ostream' are never used directly
|
|
but only as building blocks for the actually used 'Ipc_client' and
|
|
'Ipc_server' classes. Compared with the other Genode base platforms, Genode's
|
|
API for synchronous IPC communication maps more directly onto the NOVA
|
|
system-call interface.
|
|
|
|
* The Lock implementation utilizes NOVA's semaphore as a utility to let a
|
|
thread block in the attempt to get a contended lock. In contrast to the
|
|
intuitive way of using one kernel semaphore for each user lock, we use only
|
|
one kernel semaphore per thread and the peer-to-peer wake-up mechanism we
|
|
introduced in the release 9.08. This has two advantages: First, a lock does
|
|
not consume a kernel resource, and second, the full semantics of the Genode
|
|
lock including the 'cancel-blocking' semantics are preserved.
|
|
|
|
* NOVA does not support server-side out-of-order processing of RPC requests.
|
|
This is particularly problematic in three cases: Page-fault handling, signal
|
|
delivery, and the timer service.
|
|
|
|
A page-fault handler can receive a page fault request only if the previous
|
|
page fault has been answered. However, if there is no answer for a
|
|
page-fault, the page-fault handler has to decide whether to reply with a
|
|
dummy answer (in this case, the faulter will immediately raise the same page
|
|
fault again) or block until the page-fault can be resolved. But in the latter
|
|
case, the page-fault handler cannot handle any other page faults. This is
|
|
unfeasible if there is only one page-fault handler in the system. Therefore,
|
|
we instantiate one pager per user thread. This way, we can block and unblock
|
|
individual threads when faulting.
|
|
|
|
Another classical use case for out-of-order RPC processing is signal
|
|
delivery. Each process has a signal-receiver thread that blocks at core's
|
|
signal service using an RPC call. This way, core can selectively deliver
|
|
signals by replying to one of these in-flight RPCs with a zero-timeout
|
|
response (preserving the fire-and-forget signal semantics). On NOVA however,
|
|
a server cannot have multiple RPCs in flight. Hence, we use a NOVA semaphore
|
|
shared between core and the signal-receiver thread to wakeup the
|
|
signal-receiver on the occurrence of a signal. Because a semaphore-up
|
|
operation does not carry payload, the signal has to perform a non-blocking
|
|
RPC call to core to pick up the details about the signal. Thanks to Genode's
|
|
RPC framework, the use of the NOVA semaphore is hidden in NOVA-specific stub
|
|
code for the signal interface and remains completely transparent at API
|
|
level.
|
|
|
|
* Because NOVA provides no time source, we use the x86 PIT as user-level time
|
|
source, similar as on OKL4.
|
|
|
|
* On the current version of NOVA, kernel capabilities are delegated using IPC.
|
|
Genode supports this scheme by being able to marshal 'Capability' objects as
|
|
RPC message payload. In contrast to all other Genode base platforms where
|
|
the 'Capability' object is just plain data, the NOVA version must marshal
|
|
'Capability' objects such that the kernel translates the sender-local name to
|
|
the receiver-local name. This special treatment is achieved by overloading
|
|
the marshalling and unmarshalling operators of Genode's RPC framework. The
|
|
transfer of capabilities is completely transparent at API level and no
|
|
modification of existing RPC stub code was needed.
|
|
|
|
|
|
Manually booting Genode on NOVA
|
|
###############################
|
|
|
|
NOVA supports multi-boot-compliant boot loaders such as GRUB, Pulsar, or gPXE.
|
|
For example, a GRUB configuration entry for booting the Genode demo scenario
|
|
with NOVA looks as follows, whereas 'genode/' is a symbolic link to the
|
|
'var/run/demo/genode' directory created by invoking the 'demo' run script.
|
|
|
|
! title Genode demo scenario
|
|
! kernel /hypervisor iommu serial
|
|
! module /genode/core
|
|
! module /genode/config
|
|
! module /genode/init
|
|
! module /genode/timer
|
|
! module /genode/nitpicker
|
|
! module /genode/liquid_fb
|
|
! module /genode/launchpad
|
|
! module /genode/scout
|
|
! module /genode/testnit
|
|
! module /genode/nitlog
|
|
! module /genode/pci_drv
|
|
! module /genode/ps2_drv
|
|
! module /genode/fb_drv
|
|
|
|
Limitations
|
|
###########
|
|
|
|
The current NOVA version of Genode is able to run the complete Genode demo
|
|
scenario including several device drivers (PIT, PS/2, VESA, PCI) and the GUI.
|
|
Still the NOVA support is not on par with some of the other platforms.
|
|
The current limitations are:
|
|
|
|
* Threads (ECs) can not be migrated to another CPU once started.
|
|
|
|
* For portals used as exception vectors for threads, the thread causing the
|
|
exception and the handler thread which is bound to the exception portal must
|
|
be on the same CPU.
|
|
|
|
* Priorities for Genode threads are not supported.
|