

mikro -Multiprocessor Init in Kernel

ulien Frecl

Introduction

Interruptions

CPU init

Percpu variables

Conclusion

# mikro - Multiprocessor Init in Kernel

#### Julien Freche

julien.freche@lse.epita.fr
http://lse.epita.fr/

### Outline I



- Introduction
- 2 Interruptions
  - Old PIC
  - IOAPIC
  - LAPIC
  - IPI
- 3 CPU init
  - BootStrap Processor
  - Application Processor
  - INIT-SIPI-SIPI
- 4 Percpu variables
  - Usage
  - Implementation
  - Using clang
- Conclusion

mikro -Multiprocessor Init in Kernel

Julien Frech

Introduction

Interruptions

CPU init

Percpu variables

Conclusion

### Introduction



mikro -Multiprocessor Init in Kernel

ulien Frech

Introduction

Interruptions

CPU init

Percpu variables

Conclusion

# Introduction

# Too many questions



mikro -Multiprocessor Init in Kernel

ulien Frech

Introduction

Interruptions

CPU init

Percpu variables

Conclusion

I will talk about x86 system with multiple processors.

- How to handle interruptions?
- How to boot all CPUs ?
- What is the state of the system at boot?
- How to detect the number of CPU(s)?

Dealing with multiple processors requires to deal with interruptions. Let's see why.

# Interruptions



mikro -Multiprocessor Init in Kernel

Introduction

CPU init

Conclusion

# **Interruptions**

### I interrupt you



mikro -Multiprocessor Init in Kernel

Julien Fred

Introduction

#### Interruptions

IOAPI LAPIC

LAPIC IPI

CPU init

Percpu variables

Conclusion

### Interruption

Signal sent to a processor to report an event that requires immediate attention.

Interrupts can be caused by:

- Hardware: event caused by some device
- Software: system call from userland, debugging purposes, ..

### Old PIC



Multiprocessor Init

in Kernel

**Old PIC** 

CPU init

# Vintage chip



mikro -Multiprocessor Init in Kernel

ulien Frec

Introduction

Interruptions

Old PIC

LAPI

CPU init

Percpu variables

Conclusion

PIC

It controls the CPU's interrupt mechanism, by accepting several interrupt requests and feeding them to the processor in order.

#### Limitations:

- Only 8 pin per PIC (16 IRQ on x86 with two PICs).
- No SMP support, can only send interrupts to one CPU.
- Programmed with IO ports

The PIC is no replaced by the IOAPIC on modern systems.

### **IOAPIC**



mikro -Multiprocessor Init in Kernel

Julien Frec

Introduction

Interruptions
Old PIC

LAPIC

CPU init

Percpu variables

Conclusion

### **IOAPIC**

# Incoming chip



mikro -Multiprocessor Init in Kernel

ulien Fred

Introduction

Interruptions
Old PIC

LAF

CPII init

Percpu variables

Conclusion

#### 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D > 4 D >

IOAPIC

It collects interrupts from all devices and provide a way to send it to a CPU or a group of CPU.

#### Features:

- 24 pin per IOAPIC
- Memory mapped device
- You can have multiple IOAPICs
- Special bus to send interruptions: ICC bus
- Handle global interruptions (not specific to a CPU)

### **LAPIC**



mikro tiprocessor Ini in Kernel

Iulien Frec

Introduction

Interruptions

IOAPIC

IPI

CPU init

Percpu variables

Conclusion

#### **LAPIC**

# That chip is mine



mikro -Multiprocessor Init in Kernel

Iulien Fred

Introduction

Interruptions
Old PIC

LAP

CPII init

Percpu variables

Conclusion

#### Local APIC

It collects local interrupts and provide a way to a CPU to accept interrupts.

#### Features:

- 2 pin per local APIC
- Memory mapped device
- Each CPU has a local APIC
- Timer, Performance monitoring, Thermal sensor
- Use ICC bus to speak with IOAPIC(s)
- Handle local interruptions
- Each Lapic has an unique ID

### All together





mikro -Multiprocessor Init in Kernel

ulien Frech

Introduction

Interruptions Old PIC

IPI

CPU init

Percpu variables

Conclusion



mikro -Multiprocessor Init in Kernel

Julien Freche

Introduction

Interruptions

LAPIC

CPU init

Percpu variabi

Conclusion

IPI



mikro -

### Multiprocessor Init

in Kernel

Introduction

Interruptions

CPII init

Percpu variables

Conclusion

### InterProcessor Interrupts

IPIs are interrupts issued by one processor and sent to another.

- Issued by the Local Apic
- Destination
  - All including self
  - All excluding self
  - Self
  - One processor
- Delivery Mode
  - Low priority
  - NMI
  - INIT, STARTUP

# **CPU** init

Multiprocessor Init in Kernel

Julien Frec

Introduction

Interruptions

CPU init

BootStrap Processor Application Processor INIT-SIPI-SIPI

Percpu variables

Conclusion

# (Gla)DOS: I am still alive



mikro -Multiprocessor Init

in Kernel

Introduction

Interruptions

CPU init

BootStrap Processor

Application Processor

Percou variables

Conclusion

For backward compatibility all modern PCs starts in the following state:

- Only one CPU active
- Real mode (16 bits)
- PIC available to handle interruptions

# BootStrap Processor



mikro -

Multiprocessor Init in Kernel

Julien Fred

Introduction

iterruptior

CPU init

BootStrap Pro

Application Processor

Percpu variables

Conclusion

**BootStrap Processor** 

### Dibs on the kernel



### Bootstrap Processor

This processor is chosen by the BIOS to start executing the bootloader code. It is the first to execute the kernel code and must wake up the other processors if needed.

#### Duties on mikro:

- Create a GDT and an IDT
- Init the paging
- Move the kernel code to its final location
- Parse MP Tables and/or MADT table
- Init IOAPIC(s) and its own LAPIC
- Perform some per-cpu init
- Wake every processors
- Start scheduling tasks

#### mikro -Multiprocessor Init in Kernel

Iulian Frack

Introduction

Interruptions

CPU init

BootStrap Pro

Application Processor INIT-SIPI-SIPI

Percpu variables

Conclusion



# Vintage table



### MultiProcessor Table

Created by Intel in 1997. Provides MP related informations to the OS.

#### **Informations:**

- Processors list
- Buses list
- IOAPICs list
- Interrupts list

#### Limitations:

- qemu: list only processors, not core nor threads.
- On some real hardware: list only cores, not threads.
- On other hardwares: not present

#### mikro -Multiprocessor Init

in Kernel

Julien Freche

Introduction

Interruptions

CPU init

BootStrap Pro

Application Processor INIT-SIPI-SIPI

Percpu variables

Conclusion

# Colona's favorite toy



mikro -Multiprocessor Init in Kernel

ılien Frech

Introduction

Interruptions

CPU init

Application Processor

Percou variables

Conclusion

### MADT: Multiple APIC Description Table

Part of the ACPI spec. Provides informations about an SMP system.

#### Informations:

- Processors, cores, threads
- IOAPICs, x2APIC list
- Interrupt Source Override

Kind of the easy part of the ACPI (no AML).

# **Application Processor**



mikro -

Multiprocessor Init in Kernel

Julien Frec

Introduction

nterruption

CPU init

BootStrap Processor

INIT CIDI CIDI

Percpu variables

Conclusion

### **Application Processor**

### Dibs on the kernel



#### mikro -

Multiprocessor Init in Kernel

Julien Fred

Introduction

Interruptions

CPU init

BootStrap Processor Application Processo

Percpu variables

Conclusion

### **Application Processor**

Processor in the halt state, waiting for a special IPI to start executing code. This processor is in real mode.

#### Duties on mikro:

- Jump to protected mode
- Create its own GDT
- Load the existing IDT
- Perform some per-cpu init
- Init its LAPIC
- Start scheduling tasks

### **INIT-SIPI-SIPI**



Multiprocessor Init in Kernel

Introduction

Interruptions

CPU init

Percpu variables

Conclusion

#### **INIT-SIPI-SIPI**

# Wake up!



mikro -Multiprocessor Init in Kernel

Julien Frec

Introduction

Interruptions

CPU init

BootStrap Processor Application Processor

Percpu variables

Conclusion

4□ > 4□ > 4□ > 4□ > 4□ > 4□ >

A processor will start executing code when receiving this sequence of IPIs.

- Copy some code in low memory
- Send an INIT IPI
- Send a StartUP IPI
- Send another StartUP IPI

# Some magic



I found this code on the internet.

```
.init:
  : set NMI handler
 mov dword [4*2], .boot
 xor eax, eax
 mov ebx, (11b shl 18) + (0 shl 15) + (1 shl 14)
                    + (0 \text{ shl } 11) + (100 \text{ shl } 8) + 2
  ; trigger interrupts in every processor
 mov [dword 0xFEE00300 + 16], eax
 mov [dword 0xFEE00300 + 00], ebx
  ret
align 16
.boot:
```

It works on qemu and some systems.

mikro -Multiprocessor Init in Kernel

ulien Frec

Introduction

Interruptions

PU init

BootStrap Processor Application Processor

Percpu variables

Conclusion

# Let me explain it



mikro -Multiprocessor Init in Kernel

ulien Frech

Introduction

Interruptions

CPII init

BootStrap Processor Application Processor

Percpu variables

Conclusion

Steps:

- Register boot function as handler into the IDT
- Set eax to zero
- Set IPI parameters into ebx:
  - NMI
  - Physical, Assert level, Edge trigger mode
  - All excluding self
- Send the IPI using the LAPIC

Kind of the quickest hack to wake every processors!

# What is wrong?



mikro -Multiprocessor Init in Kernel

ulien Frech

Introduction

Interruptions

CPU init

BootStrap Processor Application Processor

Percpu variables

Conclusion

#### Issues:

- Do not follow the spec..
- Assume that LAPIC is at 0xfee00000 (default addr but can be changed)
- Wake every processor (even processor disabled by the BIOS)

Kind of the quickest hack to wake every processors!

# Percpu variables



mikro -Multiprocessor Init in Kernel

Julien Frec

Introduction

Interruptions

CPU init

Percpu variables

Usage Implementation

Conclusion

# Percpu variables

# Usage



mikro -

Multiprocessor Init in Kernel

Julien Fred

Introductio

nterruption

CPU init

Percpu variables

Usage

Implementat

Using clang

Conclusion

Usage

# My precious variable



mikro -Multiprocessor Init in Kernel

ılien Frech

Introduction

Interruptions
CPII init

Percpu variables

Usage

Implementation Using clang

Conclusion

### Per-cpu variable

A cpu local variable. Each cpu can has a different value stored in this variable.

#### Considerations:

- Kind of TLS (Thread Local Storage) but for processors
- Accessed using macro on Linux
- Useful to change behavior of the code depending on the CPU

# Implementation



Multiprocessor Init in Kernel

Julien Frec

ntroduction

Interruptions

CPU init

Percpu variables

mplementatio

Using clang

Conclusion

Implementation

### mikro



mikro -

Introduction

Interruptions

CPII init

Percou variables

Conclusion

Multiprocessor Init in Kernel

How to implement it?

- Create a special section in binary file for these variables
- Allocate needed space by every processor for these variables
- Set a special entry in every GDT (at the same offset for every processor) but with a different values.
- Set FS to this special entry in the GDT
- Access variables relatively to FS

# Using clang



Multiprocessor Init in Kernel

Interruptions

CPU init

Percpu variables

Conclusion

Using clang

Usage Implementation

Conclusion

```
With clang you can do the following:
```

```
# define FS_RELATIVE address_space(257)
# define PCPU_S section(".cpuvar")
# define __percpu __attribute__((FS_RELATIVE, PCPU_S))
int __percpu myvar;

void do_something()
{
    myvar = 42;
}
```

After per-cpu init, you can access your variable normally without macros. Pretty clean.

### Conclusion



mikro -Multiprocessor Init in Kernel

Julien Frech

Introduction

Interruptions

CPU init

# **Conclusion**

### Almost done



mikro -Multiprocessor Init in Kernel

ılien Frech

Introduction

Interruptions
CPU init

Percou variables

Conclusio

#### To finish:

- To support SMP in kernel, think about it early
- Very interesting subject closely related to interruptions
- Think about a clever algorithm to dispatch interrupts
- All SMP systems has IOAPIC and LAPIC, try to play with it!

### Contacts



mikro -Multiprocessor Init in Kernel

Iulien Free

Introduction

interruptioi

CPU init

Percpu variables

Conclusion

#### Julien Freche

- julien.freche@lse.epita.fr
- @JulienFreche



mikro -Multiprocessor Init in Kernel

Introduction

Interruptions

CPU init

Percpu variables

Thank you for your attention