rust-raspberrypi-OS-tutorials/04_mailboxes
2018-04-09 22:18:27 +02:00
..
raspi3_glue Add tutorial 04_mailboxes 2018-04-02 19:54:51 +02:00
src compiler_builtins are now mandatory; Add them. 2018-04-08 13:28:57 +02:00
aarch64-raspi3-none-elf.json Add abi blacklist for target 2018-04-09 22:11:43 +02:00
Cargo.lock compiler_builtins are now mandatory; Add them. 2018-04-08 13:28:57 +02:00
Cargo.toml compiler_builtins are now mandatory; Add them. 2018-04-08 13:28:57 +02:00
kernel8.img compiler_builtins are now mandatory; Add them. 2018-04-08 13:28:57 +02:00
link.ld Add tutorial 04_mailboxes 2018-04-02 19:54:51 +02:00
Makefile Fix clippy targets 2018-04-09 22:18:27 +02:00
README.md Xargo 0.3.12 supports builtins now. Remove Xargo.toml again 2018-04-08 19:56:56 +02:00

Tutorial 04 - Mailboxes

Before we could go on with UART0, we need mailboxes. So in this tutorial we introduce the mailbox interface. We'll use it to query the board's serial number and print that out on UART1.

NOTE: qemu does not redirect UART1 to terminal by default, only UART0!

uart.rs

MiniUart::hex(&self, d: u32) prints out a binary value in hexadecimal format.

mbox.rs

The mailbox interface. First we fill up the message in the mbox.buffer array, then we call Mbox::call(&mut self, channel: u32) to pass it to the GPU, specifying the mailbox channel. In this example we have used the property channel, which requires the message to be formatted as:

 0. size of the message in bytes, (x+1)*4
 1. mbox::REQUEST magic value, indicates request message
 2-x. tags
 x+1. mbox::tag::LAST magic value, indicates no more tags

Where each tag looks like:

 n+0. tag identifier
 n+1. value buffer size in bytes
 n+2. must be zero
 n+3. optional value buffer

Synchronization

When signaling the GPU about a new mailbox message, we need to take care that mailbox buffer setup has really finished. Both setting up mailbox contents and signaling the GPU is done with store operations to independent memory locations (RAM and MMIO). Since compilers are free to reorder instructions without control-flow or data-dependencies for optimization purposes, we need to take care that signaling the GPU really takes place after all of the contents have been written to the mailbox buffer.

One way to do this would be to define the whole mailbox buffer as volatile, as well as the location that we write to to signal the GPU. The compiler is not allowed to reorder memory operations tagged with the volatile keyword with each other. But this is not needed here. We don't care if the compiler optimizes the buffer setup code as long as signaling the GPU takes place afterwards.

Therefore, we prevent premature signaling by inserting an explicit compiler fence after the buffer preparation code. Since we signal the CPU by calling another function, the fence would only be effective if that function was a) inlined and b) the inlined instructions then reordered with buffer setup code. Otherwise the compiler has to assume that the called function has dependencies on previous memory operations and not reorder here. Although there is little chance that the reordering scenario happens, I'll leave the fence there nonetheless for academic purposes :-)

Please note that such reordering might also be done by CPUs that feature out-of-order execution. Lucky us, although the Rasperry Pi 3 features ARMv8.0-A CPU cores, the Cortex-A53 variant is used, which does not support this feature. Otherwise, a fence that additionally emits corresponding CPU instructions to prevent this behavior would be needed.

main.rs

We query the board's serial number and then we display it on the serial console.