rust-raspberrypi-OS-tutorials/01_bareminimum/README.md

118 lines
4.8 KiB
Markdown
Raw Normal View History

2018-04-04 18:40:41 +00:00
# Tutorial 01 - Bare Minimum
Okay, we're not going to do much here, just test our toolchain. The resulting
kernel8.img should boot on the Raspberry Pi 3, and stop all CPU cores in an
infinite waiting loop. You can check that by running
```bash
$ make qemu
... some output removed for clearity: ...
----------------
IN:
0x00080000: d503205f wfe
0x00080004: 17ffffff b #0x80000
```
## Crate setup
In this tutorial, we are compiling a kernel that is in the end only executing a
single assembly instruction which we program with an assembly file.
However, since we want to use the toolchain that is delivered with `rustup` as
much as possible, we are already setting up a Rust crate. This allows us to use
`rustc` and LLVM's `lld.ld` linker to process our assembly file.
## Target
The Raspberry Pi 3 features a processor that uses ARM's `AArch64` architecture.
Conveniently, Rust already provides a generic target for bare-metal aarch64 code
that we can leverage. It is called [aarch64-unknown-none].
[aarch64-unknown-none]: https://github.com/andre-richter/rust/blob/master/src/librustc_target/spec/aarch64_unknown_none.rs
In the `Makefile`, we select this target in various places by passing it to
cargo using the `--target` cmdline argument.
Additionally, we provide a config file in `.cargo/config` were we make further
specializations:
```toml
[target.aarch64-unknown-none]
rustflags = [
"-C", "link-arg=-Tlink.ld",
"-C", "target-feature=-fp-armv8",
"-C", "target-cpu=cortex-a53",
]
```
The first line tells rustc to use our custom `link.ld` linker script. The second
line instructs the compiler to not use floating point operations. This is a
common choice when writing an operating system kernel. If floating point is not
explicitly disabled, it is possible that the compiler uses auto-vectorization to
optimize, for example, operations on array data structures. This would
implicitly result in use of floating point registers and operations. However,
since it is very costly to save and restore floating point registers during
context-switches, use of fp is usually disabled from the get go to save the
cycles and optimize the kernel for fast context switching.
Finally, the third arguments specifies the exact CPU type that is used in
Raspberry Pi 3, so that the compiler can optimize for it.
Since the `aarch64-unknown-none` target is not shipped with an associated
precompiled standard library, and since we anyways modify the target via the
`.cargo/config` file, we are using [cargo-xbuild][xbuild] to compile our own
standard library. This way, we can ensure that our bare-metal code is optimized
throughout.
[xbuild]: https://github.com/rust-osdev/cargo-xbuild
## Linker script `link.ld`
We just set the base address where our kernel8.img will be loaded, and we put
the only section we have there, which is `.text.boot`. Important note, for
AArch64 the load address is **0x80_000**, and not **0x80_00** as with AArch32.
## Makefile
Our Makefile has a few useful targets:
- `kernel8` compiles the crate either in release or debug mode. For the latter,
add `DEBUG=1` before invoking make, e.g. `DEBUG=1 make`
- `kernel8.img` uses `cargo objcopy` to generate our kernel binary. Citing the [binutils documentation][butils]:
- "_When objcopy generates a raw binary file, it will essentially produce a
memory dump of the contents of the input object file. All symbols and
relocation information will be discarded. The memory dump will start at
the load address of the lowest section copied into the output file._"
- `qemu` loads our kernel into an emulated RPi3, and shows as output the
assembler blocks that are executed. This happens in a docker container.
[butils]: https://sourceware.org/binutils/docs/binutils/objcopy.html
2018-04-04 18:40:41 +00:00
## main.rs
We define the crate to not use the standard library (`#![no_std]`), indicate
that it does not have a main function via `#![no_main]`, and also define a stub
for the `panic_fmt()` handler, which is a requirement for `no_std` crates. We do
this by pulling in the [panic-abort][pa] crate.
2018-04-04 18:40:41 +00:00
[pa]: https://crates.io/crates/panic-abort
In summary, we (mis)use `main.rs` as a wrapper to process our assembly file via
2018-04-04 18:40:41 +00:00
`rustc`. The assembly file iself is included with the [global_asm!()][gasm]
macro.
[gasm]: https://doc.rust-lang.org/unstable-book/language-features/global-asm.html
## boot_cores.S
When the control is passed to kernel8.img, the environment is not ready yet for
Rust. Therefore we must implement a small preamble in assembly, no Rust for now.
All we do is executing [wfe][wfe], an instruction that puts the CPU cores to
sleep until an asynchronous event occurs. If that happens, we jump right back to
`wfe` again.
[wfe]: http://infocenter.arm.com/help/index.jsp?topic=/com.arm.doc.ddi0360e/CHDBGCFH.html
Note that the CPU has 4 cores. All of them will execute the same infinite loop
for now.