Y86-64学习1-State & Instruction & Basic Encoding
Made by Mike_Zhang
Computer System 相关文章:
有符号二进制数表示方法 Signed binary number representation
浮点数二进制数表示方法 Floating point numbers representation
UltraFish Plus - 有符号二进制数转换器 Signed binary number convertor
UltraFish Plus - 浮点数表示方法转换器 Floating Point Numbers Representation Convertor
UltraFish Plus - 多进制整数转换器 Multiple Bases Unsigned Integer Convertor
Y86-64学习1-State & Instruction & Basic Encoding
Y86-64学习2-Y86-64 SEQ Stages
x86-64学习1-Introduction & Data Formats & Information Accessing & Arithmetic Logical Operation
x86-64学习2-Control
1 Accessible & Modifiable State
Y86-64 programmer-visible state
The programmer can access and modify these processor state.
Similar to x86-64, but more simpler and less compact.
data:image/s3,"s3://crabby-images/d9a21/d9a21531aca629e632f2518ae47f81e57e707a3a" alt=""
Comparing with the register part of x86-64:
data:image/s3,"s3://crabby-images/083b2/083b2d219d0aab35395fe2fbdbf76928fff66a59" alt=""
1.1 Y86-64 Program Registers
- 15 Program Resister;
- No
%r15
, to simplify the encoding; - 64-bit word, 8 words;
%rsp
for Stack Pointer, NO fixed meaning or value for others.
data:image/s3,"s3://crabby-images/bb4ea/bb4ea339c919e9c6ef9ffdbd2c09f08de216dc26" alt=""
1.2 Condition Codes
- CC: Condition Code;
- 3 single-bit codes;
- ZF, SF, OF;
- ZF: Zero Flag. The most recent operation yielded zero.
- SF: Sign Flag. The most recent operation yielded a negative value.
- OF: Overflow Flag. The most recent operation caused a two’s-complement overflow—either negative or positive.
1.3 Program Counter
- PC: Program Counter;
- Store the address of currently executing instruction.
1.4 Memory
- Virtual memory;
- In the Operand forms, only represented in base and displacement, NO index and scale in x86-64.
1.5 Program State
- Stat: Program State;
- The overall state of program execution;
- Normal operation or exception.
2 Y86-64 Instructions
data:image/s3,"s3://crabby-images/451ff/451ff20755cd8c5abac8e4edb6effa1215257be7" alt=""
- A subset of x86-64 instruction set;
- 8-byte integer operations;
- fewer address modes;
- smaller set of operations;
- Each instruction set including:
- 1-byte instruction specifier (e.g.,
0|0
forhalt
), including(op|fn
):- 4-bit operation code(
op
) and, - 4-bit function code(
fn
) to specify a particular function ;
- 4-bit operation code(
- (possibly) 1-byte register specifier (e.g.,
rA|rB
,F|rB
); - (possibly) 8-byte constant word (e.g.,
V
,D
,Dest
);
- 1-byte instruction specifier (e.g.,
- 4 types of instruction: 1-byte, 2-byte, 9-byte, and 10-byte instruction;
- 1-byte:
halt
,nop
,ret
(only instruction specifier); - 2-byte:
rrmovq rA, rB
,OPq rA, rB
,cmovXX rA, rB
,pushq rA
,popq rA
(only instruction specifier + register specifier); - 9-byte:
jXX Dest
,call Dest
(only instruction specifier + constant word); - 10-byte:
irmovq V, D(rB)
,rmmovq rA, D(rB)
,mrmovq D(rB), rA
(instruction specifier + register specifier + constant word);
- 1-byte:
- Encoded in hexadecimal value.
2.1 movq
Instructions
data:image/s3,"s3://crabby-images/51501/51501781f811c5220fb81ec956f6f3ee4377a4cd" alt=""
- subset of x86-64 movq instruction set;
- Indicating the movement: source $\to$ destination;
- Source: first character - immediate(
i
), register(r
), memory(m
); - Destination: second character - register(
r
), memory(m
);
- Source: first character - immediate(
- NO memory(
m
) location $\rightarrow$ another memory(m
) location; - NO immediate(
i
) data $\rightarrow$ memory(m
);
The movement is from the first argument to the second argument, usually
rA(V)
$\to$rB
,
exceptmrmovq D(rB), rA
, which isrB
$\to$rA
The immediate value(
V
) and displacement(D
) is 8-byte constant word.memory reference: only base and displacement(e.g.,
D(rB)
,rB
for base,D
for displacement), NO second index register or scale;
data:image/s3,"s3://crabby-images/1ebf6/1ebf60af7137a49f6bcb242d3d927cbb6e94489d" alt=""
2.2 OPq
Integer Operation Instructions
data:image/s3,"s3://crabby-images/d2bd6/d2bd66f79b08b9ba0f81472321a7daa2b62e825a" alt=""
data:image/s3,"s3://crabby-images/7e5cd/7e5cd88ca21119283708af80d811520f9f4d136c" alt=""
- subset of x86-64 Integer arithmetic operations;
- 2-byte instruction;
- 4 instructions:
addq
,subq
,andq
,xorq
; - Only operate on register data, NOT on memory data;
- Sets 3 conditional code
ZF
,SF
, andOF
. - Function code for
fn
:
data:image/s3,"s3://crabby-images/651a1/651a1b52b009b2736e3aeb495991a31fcaa5a6e1" alt=""
2.3 jXX
Jump Instructions
data:image/s3,"s3://crabby-images/9feb2/9feb2d949983a556040c14370a37681ab2d6d04b" alt=""
data:image/s3,"s3://crabby-images/1f99f/1f99f9b50e3a8076239e72c299c95fdcc513b8e5" alt=""
- subset of x86-64 Jump instructions;
- 9-byte instruction;
- 7 instructions:
jmp
,je
,jne
,jg
,jge
,jl
,jle
; - according to the conditional codes(CC).
- Function code for
fn
:
data:image/s3,"s3://crabby-images/56f56/56f56bdd3cc6e0471f9456f91d5d55a08e5ab9ef" alt=""
2.4 cmovXX
Conditional Move Instructions
data:image/s3,"s3://crabby-images/29070/29070c5fa1411fbdbf5bfe5e081e425ca625c4e1" alt=""
data:image/s3,"s3://crabby-images/00550/005506944aba8ec7dcf5ba11fbf5d60d1d4321d7" alt=""
- subset of x86-64 Conditional move instructions;
- 2-byte instruction;
- 7 instructions:
cmove
,cmovne
,cmovg
,cmovge
,cmovl
,cmovle
; - same format with register-register move -
rrmovq
; - move occurs only if condition satisfied.
- Function code for
fn
:
data:image/s3,"s3://crabby-images/498fb/498fb76c3d81b93275c6a96515e4acad5754731d" alt=""
2.5 call
Instructions
data:image/s3,"s3://crabby-images/73474/734742ca37b67a02932cbc85c02a211740c1a19a" alt=""
- 9-byte instruction;
- First, push the return address into the stack, the return address refers to the address of the instruction immediately after the
call
instruction; - Second, jump to the
Dest
address by setting thePC
to the destination address;
ret
instruction:
data:image/s3,"s3://crabby-images/c6688/c6688e31115c55a7f7c3e90614fe7f43366fd7be" alt=""
- 1-byte instruction;
- the instruction pop the address from stack, then set the
PC
to that address.
2.6 pushq
& popq
Instructions
data:image/s3,"s3://crabby-images/c3c4c/c3c4ca050aaac79e77bf166c372bd1018be4c249" alt=""
- 2-byte instruction;
- as same as in x86-64 Push and pop instructions:
data:image/s3,"s3://crabby-images/f73e6/f73e6083e8e50d1ac423117b19bc0fff643feddd" alt=""
2.7 halt
Instructions
data:image/s3,"s3://crabby-images/9e7e2/9e7e29bb995c9e88d7c2a52a661971e56991b347" alt=""
- Stops instruction execution;
- 1-byte instruction.
2.8 nop
Instructions
data:image/s3,"s3://crabby-images/5256e/5256ee962312c518c39973792e50525b9d0b674a" alt=""
- Do nothing;
- 1-byte instruction.
3 Encoding
3.1 Instruction Specifier
Having been mentioned in Section 2.
Every instruction has a type specifier, which is the first byte.
It can be separated into two 4-bit parts, operation code and function code.
- operation codes range from 0 to 0xB.
- function code have special values in integer operation, branch, and move instructions; 0 for rest instructions:
data:image/s3,"s3://crabby-images/ba9e6/ba9e6b13dedd79099add2179efb9c2c2343d9517" alt=""
3.2 Register Identifier
In Y86-64, some instruction has register operands, such as rrmovq
, which associated with the Program Register, who also need to be encoded.
- Each Program Register has its Register Identifier, ranging from 0 to 0xE.
data:image/s3,"s3://crabby-images/4cfd8/4cfd8d30bc4907f7787a39ca7d25290deffb413f" alt=""
- The 0xF register will not be accessed.
- Some instruction do not require register specifier;
- Some instruction require only one register specifier, such as
irmovq
, which need to set another register specifier to 0xF for easy implementation.
3.3 Constant Word Encoding
3 types of 8-byte constant word:
- immediate data(
V
); - displacement for address specifier(
D
); - destination for address specifier(
Dest
);
For the destination address in branch and call instructions, the destination is the absolute address, NOT the PC-related address in x86-64.
- all constant integer is encoded in little-endian encoding, which means every byte should be reversed when encoding.
little-endian means the LSP(right-most) byte appears first;
For 0x 0A 0B 0C 0D $\to$ 0D 0C 0B 0A (reversed)
3.4 Example
To encode:1
2
3
4
5
6
7.pos 0x100 # Start code at address 0x100
irmovq $15,%rbx
rrmovq %rbx,%rcx
loop:
rmmovq %rcx,-3(%rbx)
addq %rbx,%rcx
jmp loop
Practice Problem 4.1 (CS: APP)
Solution steps:
1 |
|
the start position of each instruction is depended on the length of the previous one.
1 |
|
3|0|F|rB|V
:
rB
: %rbx
$\to$ 0x3
V
: $15
$\to$ 0x 00 00 00 00 00 00 00 0f $\to$ 0x 0f 00 00 00 00 00 00 00
1 |
|
1 |
|
2|0|rA|rB
:
rA
: %rbx
$\to$ 3
rB
: %rcx
$\to$ 1
1 |
|
1 |
|
4|0|rA|rB|D
rA
: %rcx
$\to$ 1
rB
: %rbx
$\to$ 3
D
: -3 $\to$ 0000 0000 … 0011 $\to$ 1111 1111 … 1101 (2’s complement) $\to$ 0x ff ff ff ff ff ff ff fd $\to$ fd ff ff ff ff ff ff ff
1 |
|
1 |
|
6|0|rA|rB
rA
: %rbx
$\to$ 3
rB
: %rcx
$\to$ 1
1 |
|
1 |
|
7|0|Dest
Dest
: loop
$\to$ 0x10a $\to$ 0x 00 00 00 00 00 00 01 0c $\to$ 0c 01 00 00 00 00 00 00
1 |
|
Answer:
1 |
|
参考
B. Randal, D. R. O’Hallaron, Computer systems : a programmer’s perspective, Third edition. Boston: Pearson, 2016.
写在最后
Y86-64相关的知识会继续学习,继续更新.
最后,希望大家一起交流,分享,指出问题,谢谢!
原创文章,转载请标明出处
Made by Mike_Zhang
data:image/s3,"s3://crabby-images/405f6/405f6cd6c8256b0ebd9db9a1c060c4da751a9849" alt=""
data:image/s3,"s3://crabby-images/e3db6/e3db6c6063d2beac65cc865909bf8c870f9dc6c7" alt=""