x86-64学习1-Introduction & Data Formats & Information Accessing & Arithmetic Logical Operation
Made by Mike_Zhang
Computer System 相关文章:
有符号二进制数表示方法 Signed binary number representation
浮点数二进制数表示方法 Floating point numbers representation
UltraFish Plus - 有符号二进制数转换器 Signed binary number convertor
UltraFish Plus - 浮点数表示方法转换器 Floating Point Numbers Representation Convertor
UltraFish Plus - 多进制整数转换器 Multiple Bases Unsigned Integer Convertor
Y86-64学习1-State & Instruction & Basic Encoding
Y86-64学习2-Y86-64 SEQ Stages
x86-64学习1-Introduction & Data Formats & Information Accessing & Arithmetic Logical Operation
x86-64学习2-Control
1 Introduction
Machine-Level Language:
- ISA: Instruction Set Architecture, defining:
- the Processor State;
- the format of the instructions;
- the effect of each instruction on the state;
- Use Virtual Addresses as memory addresses;
Compiler: transforming programs into the elementary instruction (machine code in binary)
Assembly-code: the code very close to the machine-code, but more readable, as it is not in binary format.
Processor Visible State behind C programmer:
- Program Counter: PC,
%rip
in x86-64, the address of the NEXT instruction to execute; - Register File: 16 location with 64-bit of each, stored addresses and integer data:
- track the critical parts of the state;
- store the temporary data, i.e., arguments, local variables, and return values;
- Conditional Code: store the state of recently executed arithmetic or logical instruction, can be used as the condition of control and date flow, e.g.,
if
andwhile
; - Set of Vector Register: store one or more integer or floating-point values.
Program Memory stores:
- program machine-code;
- operating system information;
- run-time stack for calls and returns;
- allocated by user.
2 Data Formats
word
: 16-bit (2-byte) data type;double word
: 32-bit (4-byte) data type;quad word
: 64-bit (8-byte) date type;char *
: pointer, 8-byte quad word.
data:image/s3,"s3://crabby-images/4587b/4587b5447817493d392571e7b3ab6c4a9ccc5ee0" alt=""
3 Accessing Information
data:image/s3,"s3://crabby-images/88519/88519b03dd8aab3ce1725aae066d1ebd45ce6766" alt=""
x86-64 CPU has a set of 16 64-bit general-purpose register, storing integer data and pointers(addresses):
- Start from 8086, 8 16-bit register (in RED box):
%ax
to%bp
; - Then to IA32, extended to 8 32-bit register (in BLUE box) :
%eax
to%ebp
; - Finally to x86-64, extended to 8 64-bit register (in BLACK box):
%rax
to%rbp
, as well as additional new 8 64-bit register (in DOT LINE box) :%r8
to%r15
;
Different registers have different functions:
%rsp
has a specific function: stack pointer, indicate the end position of run-time stack;- Other 15 registers have more flexible functions.
Instructions can operate on different data size in low-order of the 16 registers:
- 8-bit instruction: can access least significant 1 byte;
- 16-bit instruction: can access least significant 2 byte;
- 32-bit instruction: can access least significant 4 byte;
- 64-bit instruction: can access entire register;
3.1 Operand Specifiers
data:image/s3,"s3://crabby-images/a9619/a961983510c9455f80085782a96439b14633d818" alt=""
Three Types:
- Immediate:
- constant value, started with
$
, followed by a integer in standard C notation;
- constant value, started with
- Register:
- contents of a register, each bit length of instruction has its specific among of bits (e.g., 8-byte register:64 bits);
- Notation $r_a$ indicates the register $a$ and its value in reference $R[r_a]$ indexed by the register identifiers in an array $R$;
- Memory:
- Access memory location based on the computed address - effective address;
- $M_b[Addr]$: reference to the $b$-byte value in memory starting at address $Addr$;
- $Imm(r_b,r_i,s)$: the most general form:
- $Imm$: immediate offset;
- $r_b$: base register, 64-bit;
- $r_i$: index register, 64-bit;
- $s$: scale factor, must be 1,2,4, or 8;
- effective address $=Imm+R[r_b]+R[r_i]\cdot s$;
- The value is $M[Imm+R[r_b]+R[r_i]\cdot s]$;
- These complex addressing modes useful in Array and structure elements referencing.
[Example]
data:image/s3,"s3://crabby-images/9ca16/9ca166bff475466fa0af22a4dfbc8cfcca865ae3" alt=""
data:image/s3,"s3://crabby-images/20dd0/20dd07f408759cd90a9b39da9da0e7df53d635e5" alt=""
3.2 Data Movement Instructions
3.2.1 MOV
Instructions
Copy data from a source location to a destination location, without transformation.
data:image/s3,"s3://crabby-images/0048a/0048af4ef6b3a5b91c74bfeed717ea9ab8a19e5b" alt=""
Source(S):
- value of immediate;
- value in register;
- value in memory.
Destination(D):
- register;
- memory address.
Copy from a memory to another memory:
Can not directly copy, first load the memory value to a register, then write the register value to the destination memory.
movabsq
:
- S: 64-bit immediate value;
- D: must be a register.
For register operand:
- The size of the register must match the last character of the instruction(
b
,w
,l
,q
); - The
MOV
instruction will only update the specific byte indicated by the destination operand, - Except the
movl
instruction with the register destination, it will set high-order 4-byte to 0. - (For the convention in x86-64 from 64-bit to 32-bit to adopt)
[Example]
data:image/s3,"s3://crabby-images/4c704/4c7049194bb20f0d1c9069f40f2ff06c8e982281" alt=""
3.2.2 MOVZ
Instructions
Copy a small source value to a larger destination, fill the remaining bytes in destination with zeros.
data:image/s3,"s3://crabby-images/b97de/b97defdb2dc9693f9744f3edd9b8842a3555d6a3" alt=""
- S: register, memory;
- D: register;
- Last 2 character: size of source and size of destination;
- size of destination $\gt$ size of source.
- NO
movzlq
, implemented bymovl
: with 4-byte register as destination, it will fill the upper 4-byte with zeros.
3.2.3 MOVS
Instructions
Copy a small source value to a larger destination, fill the remaining bytes in destination by sign extension (copy the most significant bit).
data:image/s3,"s3://crabby-images/72903/72903a5c6ffe8b2e24bb6f3960a2a89c151a7383" alt=""
- S: register, memory;
- D: register;
- Last 2 character: size of source and size of destination;
- size of destination $\gt$ size of source.
cltq
: no operand,%eax
as source,%rax
as destination with sign-extended, as same asmovslq %eax, %rax
.
[Example]
data:image/s3,"s3://crabby-images/611da/611dabab4d1153ca7a44b5c3e2e9b7dad88b31b4" alt=""
data:image/s3,"s3://crabby-images/e1405/e1405da9e3e3461d47b7a9fe1d1583548c74752d" alt=""
[Example]
data:image/s3,"s3://crabby-images/60427/60427c8183158c705ffe55f994f073f0ad8d1b15" alt=""
data:image/s3,"s3://crabby-images/aec04/aec04eeacbce679631b963e606ca076745ccaee3" alt=""
3.3 Push & Pop Instructions
data:image/s3,"s3://crabby-images/abb08/abb08160244adeff80202b5d49c56df2b3052026" alt=""
data:image/s3,"s3://crabby-images/f1f79/f1f7932a06f53b7effe2fd5eb437561894864ad9" alt=""
pushq %rbp
:
1 |
|
popq %rax
:
1 |
|
Stack is contained in the same memory with program code and other program data, it can be accessed arbitrary positions within the stack, by using the standard memory addressing method;
e.g.,movq 8(%rsp), %rdx
, copy the second quad word in the stack to%rdx
.
4 Arithmetic & Logical Operations
data:image/s3,"s3://crabby-images/3ba35/3ba35f85f3a32834ec549fd459dd569becd9db93" alt=""
4.1 leap
Instructions
- load effective address instruction;
- read memory address to a register;
- NO access to the memory, just load the address;
- $\&S$: C address operator, like a pointer;
[Example]
data:image/s3,"s3://crabby-images/c3cfc/c3cfcf337673bb1094635eb094f7a75ad27cfebf" alt=""
data:image/s3,"s3://crabby-images/4c2ac/4c2ac4178b4aaacb73a3a26f54339ba03dbe9b19" alt=""
4.2 Unary & Binary Instructions
data:image/s3,"s3://crabby-images/f135a/f135af8f0b347bfb9ffc664c7a381b37a69214d7" alt=""
Unary Instructions:
- Operand can be register or memory location.
data:image/s3,"s3://crabby-images/2174f/2174f0f47ed968ec3e8e857a893ef19d37c6a4c2" alt=""
Binary Instruction:
- S: immediate value, register, memory location;
- D: register, memory location;
- S, D can NOT both be memory;
- Source operand first, Destination second;
Fun S, D
—>D = D fun S
subq %rax, %rdx
:%rdx = %rdx - %rax
(Subtract%rax
from%rax
)
[Example]
data:image/s3,"s3://crabby-images/3f128/3f1285c2a8c1438d2e243ac0e661eb692abc5385" alt=""
data:image/s3,"s3://crabby-images/8c3b7/8c3b751456c1efda9b6e76919efb402441c5e9b0" alt=""
4.3 Shift Instructions
data:image/s3,"s3://crabby-images/61316/6131642990c92de36ee2131c8f79cfe6fba73fd3" alt=""
Source and Destination can be register or memory location.
Shift amount (2 ways):
- immediate value:
k
- single-byte register
%cl
:- based data: w-bit (i.e. 8,16,32,64);
- shift amount: value of low-order m-bit of
%cl
, $2^m=w,m=\log_2w$;- e.g. 8-bit: lower 3-bit value of
%cl
; - 64-bit: lower 6-bit value of
%cl
;
- e.g. 8-bit: lower 3-bit value of
- Example:
%cl
= 0xFF = 1111 1111:salb
: 8-bit, shift lower 3-bit value = 111 = 7;salw
: 16-bit, shift lower 4-bit value = 1111 = 15;
Left Shift:
SAL
: arithmetic left shift;SHL
: logical left shift;- Same effect, fill right with zero;
Right Shift:
SAR
: arithmetic right shift, fill copy of the sign bits;SHR
: logical right shift, fill left with zeros;
[Example]
data:image/s3,"s3://crabby-images/e85e1/e85e12fdd8c27d093a6ff44af841db90ae72f0a3" alt=""
data:image/s3,"s3://crabby-images/14576/14576f866df17ed5db23da4587169210f592a661" alt=""
4.4 Special Arithmetic Instructions
data:image/s3,"s3://crabby-images/0e2ae/0e2aea891a4ed9df4c5f9acb2f573ecff4f75e4e" alt=""
Multiply:
- Different from 2-operand
imul
(generating 64-bit from two 64-bit operand); - It only has 1 operand, generating 128-bit from two 64-bit operand, a full multiply;
imulq
: signed (two’s complement) multiply;mulq
: unsigned multiply;- One argument must be in register
%rax
; - Other one is given as
S
; - Stored in high-order 64-bit:
%rdx
, and low-order 64-bit:%rax
.
[Example]
data:image/s3,"s3://crabby-images/e3e69/e3e696c23f72783981c23adbe9a7d12315638024" alt=""
data:image/s3,"s3://crabby-images/68e88/68e88095b0ebd5b1db47b183e01ccdf5bf01ba74" alt=""
Division:
- Single-operand instruction;
- Dividend: high-order 64-bit:
%rdx
, and low-order 64-bit:%rax
; - Divisor: given as
S
; - Store quotient in
%rax
; - Store remainder in
rdx%
cqto
:
- NO operand;
- Copy
%rax
and extends it to%rdx
; - Convert it to oct word.
参考
B. Randal, D. R. O’Hallaron, Computer systems : a programmer’s perspective, Third edition. Boston: Pearson, 2016.
写在最后
x86-64相关的知识会继续学习,继续更新.
最后,希望大家一起交流,分享,指出问题,谢谢!
原创文章,转载请标明出处
Made by Mike_Zhang
data:image/s3,"s3://crabby-images/405f6/405f6cd6c8256b0ebd9db9a1c060c4da751a9849" alt=""
data:image/s3,"s3://crabby-images/e3db6/e3db6c6063d2beac65cc865909bf8c870f9dc6c7" alt=""