SlideShare une entreprise Scribd logo
1  sur  25
Assembler




            1
System Software

• components
     – translator
         • assembler
         • compiler
         • interpreter
     – system manager
         • operating system
     – other utilities
         • loader
         • linker
         • DBMS, editor, debugger, ...
•   purpose of this course
     – understand how to build system software
     – understand how these components work




                                                 2
Issues in System Software

•   not many in this area
     – mature area
•   advanced architectures complicates system software
     – superscalar CPU
     – memory model
     – multiprocessor
•   new applications
     – embedded systems
     – mobile/ubiquitous computing




                                                         3
Assembler Overview

•   functions
     – translate programs written in assembly language to machine code
          • mnemonic code to machine code
          • symbols to addresses
     – handles
          • constants
          • literals
          • addressing
•   32 bit constant or address
•   32 bit offset




                                                              4
Assembler Overview (cont’d)
•   pass 1: loop until the end of the program
     1. read in a line of assembly code
     2. assign an address to this line
           • increment N (word addressing or byte addressing)
     3. save address values assigned to labels
           • in symbol tables
     4. process assembler directives
           • constant declaration
           • space reservation
•   pass2: same loop
     1. read in a line of code
     2. translate op code
           using op code table
     3. change labels to address
           using the symbol table
     4. process assembler directives
     5. produce object program

                                                                5
Data Structures for Assembler

                               add $t0, $t1, $t2   000000 01001 01010 01000 00000 100000

•   op code table
     – looked up for the translation of mnemonic code
         • key: mnemonic code
         • result: bits
     – hashing is usually used
         • once prepared, the table is not changed
         • efficient lookup is desired
         • since mnemonic code is predefined, the hashing function can
            be tuned a priori
     – the table may have the instruction format and length
         • to decide where to put op code bits, operands bits, offset bits
         • for variable instruction size
         • used to calculate the address



                                                                  6
Data Structures for Assembler (cont’d)
                                                       .text
                                                       .globl main
•   symbol table                               main:
                                                       la      $t0, array
     – stored and looked up to assign                  lw      $t1, count
       address to labels                               lw      $t2, ($t0)
                                               loop:
         • efficient insertion and retrieval           lw   $t3, 4($t0)
            is needed                                  ble  $t3, $t2, loop2
         • deletion does not occur                     move $t2, $t3

     – difficulties in hashing                 loop2: add $t1, $t1, -1
                                                      add $t0, $t0, 4
         • non random keys                            bnez $t1, loop
     – problem
                                                       …
         • the size varies widely                      ….
                                                       .data
                                               array:          .word 3, 5, 5, 1, 6, 7, …..
                                               count:          .word 15
                                               string1:        .asciiz “nmax = “




                                                                  7
Symbol Table Construction


        .text
        .globl main                           symbol name       value
main:                                            main            0
        la      $t0, array
        lw      $t1, count                       loop            12
        lw      $t2, ($t0)
loop:                                            loop2           24
        lw   $t3, 4($t0)
        ble  $t3, $t2, loop2                      …
        move $t2, $t3
                                                 array          408
loop2: add $t1, $t1, -1                          count          468
       add $t0, $t0, 4
       bnez $t1, loop                           string1         472
        …                                        bad            478
        ….
        .data
array:          .word 3, 5, 5, 1, 6, 7, …..
count:          .word 15
string1:        .asciiz “nmax = “
bad:            .word 7


                                                            8
Assembler Algorithm: pass1
begin
   if starting address is given
       LOCCTR = starting address;
   else
       LOCCTR = 0;
   while OPCODE != END do                ;; or EOF
       begin
       read a line from the code
       if there is a label
              if this label is in SYMTAB, then error
              else insert (label, LOCCTR) into SYMTAB
       search OPTAB for the op code
       if found
              LOCCTR += N ;; N is the length of this instruction (4 for MIPS)
       else if this is an assembly directive
              update LOCCTR as directed
       else error
       write line to intermediate file
       end
   program size = LOCCTR - starting address;
end



                                                            9
Assembler Algorithm: pass2
begin
   read a line;
   if op code = START then ;; .globl xxx for MIPS
       write header record;
   while op code != END do ;; or EOF
       begin
       search OPTAB for the op code;
       if found
              if the operand is a symbol then
                      replace it with an address using SYMTAB;
               assemble the object code;
       else if is a defined directive            add $t0, $t1, $t2 =>
              convert it to object code;         000000 01001 01010 01000 00000 100000
       add object code to the text;
       read next line;
       end
   write End record to the text;
   output text;
end




                                                             10
Program Relocation
                                 0         .
                                           .
                .                    jump to 1004          1004
                .                          .
                              1076                         5000         .
          jump to 1004                                                  .
                .                                                 jump to 1004
                                                                        .
                                                           6076



                              program is loaded at 0       program is loaded at 5000

•   motivations for relocation
     – a program may consists of several pieces of codes that are assembled
       independently
     – when a program is assembled, it is impossible to know the exact location
       where the program starts




                                                                11
Program Relocation (cont’d)

•   distances from the origin of a program do not change
     – make the address relative to the origin
     – provides loader with information about
          • which address needs fixing
          • length of address field
     – the loader change those addresses as
          • distance + start address of a program
     – only absolute addresses need to be changed




                                                           12
Literals

•   usage
     – encoded as an operand (similar to the immediate in MIPS, but different)
          • load $7, =X’0A7F’
     – simple way to declare a constant
     – assembler does
          • declare a constant with a label
          • use the label to use the value
•   comparison with immediate
     – literal is an assembler directive
          • immediate is a machine recognizable data
     – full word can be used for literals
          • immediate: full word – (opcode, registers)
     – values are obtained from data memory - slow
          • immediate data is within the instruction itself




                                                                13
Literals (cont’d)

•   literal pool
      – assembler collects all the literals into one or more literal pools
      – default location is at the end of the program
           • for better code reading
      – programmer can declare a place (LTORG)
           • to use PC-relative addressing
           • to keep data close to instruction
•   optimization
      – make one literal for the same value
           • compare character string or value?
               – x’454F46’ = c’EOF’
           • value comparison needs evaluation
•   literal table
      – name(label), operand value, operand length, address in the table
      – name and value are all used as a key


                                                                     14
Literal Handling Algorithm

pass 1
   at a recognition of a literal
       search LITTAB by name
       if found but different value, error
       else if the same value, no action
       else if not found insert a new literal (no address yet)
   if the code is LTORG or END
       allocate each literal assigning an address

pass 2
   replace each literal with the address in the LITTAB
   if these addresses are absolute,
       prepare modification for relocation




                                                        15
Symbol Defining Statement

•   MAXLEN            EQU 4096
     – makes program structure better
     – easier to modify a single location
     – easier to remember than numbers
     – registers can be given meaningful names
     – (maxlen = 4096) in MIPS
•   assembler
     – searches SYMTAB and replace the symbol with the value in the table
     – resulting object code is the same as using the value instead of symbol
     – remember that with 2 passes there is restriction
               X    EQU Y
               Y    EQU 100
          • X cannot be defined in pass 1




                                                                 16
Expressions

BUFFER: .space 4096            ; reserve 4096 bytes here
BUFEND:            ; set current location to BUFFEND
(MAXLEN = BUFEND – BUFFER) ; calculate the size of the buffer



•   allows simple arithmetic operations in symbol definition
•   operands may have relative values for relocation
     – relative values should be modified by the loader later
          • we need to know which is relative
     – symbol table needs a type field to discern absolute symbols from relative
        symbols




                                                                17
Expression Rules


•   basic
     – constant is absolute
     – address is relative
•   using expressions
     – expression with absolute arguments is absolute
     – expression that has multiplication and division is absolute
     – relative_1 - relative_2 is absolute
          • dependencies on starting address are canceled out
     – all the other expressions having relative terms are neither relative nor
        absolute (error?)
          • constant - relative
          • relative_1 + relative_2
          • 3 x relative_1




                                                                   18
Program Blocks

source                     object code

block 0
                             block 0
block 1
              assembled
block 2
block 0                      block 1

block 1
                             block 2
block 2




                                       19
Program Blocks (cont’d)

•   motivation
     – programmer’s view may be different from machine’s view
          • affects only efficiency not functionality
     – addressing can be simplified
          • large data area can be moved to the end of code while source code places
            it close to the instructions that use this data
•   data structure and algorithm
     – block table (name, block number, address, length)
     – pass 1
          • maintain separate LOCCTR for each block
              – each label is assigned address relative to the start of the block that contains it
         • SYMTAB stores block number for each symbol
         • store starting address of each block in block table
     – pass 2
         • assign address to each symbol by adding the relative address to the block
           starting address

                                                                          20
Control Sections

•   control section is a part of program that can be assembled independent of
    other parts
     – a large problem can be divided into many control sections
     – each control section can be developed independently
     – each control section can be modified independently
•   symbols defined in other control sections
     – called external
     – assembler prepares those symbols
     – loader & linker resolves the value of external symbols




                                                             21
Control Sections (cont’d)

•   a table prepared by assembler
     – define record
          • name of symbol defined in this control section
          • relative address of the symbol
     – refer record
          • name of external symbols
     – modification record
          • starting address of field to be modified
          • length of this field
          • name of external symbol
•   loader
     – for every external symbol
          • find the relative address from the define record
          • add the starting address of the control section where the symbol is defined
          • modify the field


                                                                  22
One-Pass Assembler

•   problem
     – forward reference: reference to symbols that are not defined yet
•   why do we need one-pass assembler?
     – fast
          • useful for program development and testing
          • university computing environment
•   load-and-go assembler
     – writes the object code on memory not on disk file
     – since it is on memory it is easy to modify a part of object code




                                                                23
One-Pass Assembler (cont’d)

•   one-pass assembler for load-and-go
     – stores undefined symbols in the SYMTAB with the address of the field that
        references this symbol
     – when the symbol is defined later, look up the SYMTAB and modify the field
        with correct address
          • there may be many places to be modified
•   what if object code is written on disk?
     – bring back the text to memory
          • efficiency of one-pass assembler cannot be justified
     – make loader to modify the address at loading time
          • modification record again
•   optimization
     – require all the data declaration be placed at the beginning of the program
          • reduces reference resolution




                                                              24
Multi-Pass Assembler
  •   support forwarding reference even though it is bad for program readability

                        at 1, store in a table two tuples
                           (A, 1, B/2, 0)
                           1: one symbol is missing
1.(A = B/2)                0: no other symbol depends on A
2.(B = C-D)                (B, *, , &LB)
   ....                    *: don’t know how many symbols missing yet
8. C .....
9. D ..…                   LB: list of symbols that depend on B (now, there is only A in this list)
                        at 2,
                           insert (C,*, ,&LC), (D,*, ,&LD)
                                    LC and LD contains only B
                           modify (B,*, ,&LB) as (B,2,C-D,&LB)
                        after 8
                           from LC, B is found
                           change 2 to 1 in the B tuple meaning one symbol remains to be defined
                        after 9
                           from LD, B is found
                           now evaluate B with defined C, D values
                        since B is done
                           from LB, A is found
                           now A can be evaluated
                                                                       25

Contenu connexe

Tendances

Ch 3 Assembler in System programming
Ch 3 Assembler in System programming Ch 3 Assembler in System programming
Ch 3 Assembler in System programming Bhatt Balkrishna
 
Code optimization in compiler design
Code optimization in compiler designCode optimization in compiler design
Code optimization in compiler designKuppusamy P
 
Data Encryption Standard (DES)
Data Encryption Standard (DES)Data Encryption Standard (DES)
Data Encryption Standard (DES)Haris Ahmed
 
System Programming- Unit I
System Programming- Unit ISystem Programming- Unit I
System Programming- Unit ISaranya1702
 
System Programming Unit II
System Programming Unit IISystem Programming Unit II
System Programming Unit IIManoj Patil
 
Microprocessor chapter 9 - assembly language programming
Microprocessor  chapter 9 - assembly language programmingMicroprocessor  chapter 9 - assembly language programming
Microprocessor chapter 9 - assembly language programmingWondeson Emeye
 
System Programing Unit 1
System Programing Unit 1System Programing Unit 1
System Programing Unit 1Manoj Patil
 
Assembler design option
Assembler design optionAssembler design option
Assembler design optionMohd Arif
 
Introduction to systems programming
Introduction to systems programmingIntroduction to systems programming
Introduction to systems programmingMukesh Tekwani
 
Examinable Question and answer system programming
Examinable Question and answer system programmingExaminable Question and answer system programming
Examinable Question and answer system programmingMakerere university
 
Single pass assembler
Single pass assemblerSingle pass assembler
Single pass assemblerBansari Shah
 
Introduction to system programming
Introduction to system programmingIntroduction to system programming
Introduction to system programmingsonalikharade3
 
Lexical Analysis - Compiler Design
Lexical Analysis - Compiler DesignLexical Analysis - Compiler Design
Lexical Analysis - Compiler DesignAkhil Kaushik
 

Tendances (20)

Ch 3 Assembler in System programming
Ch 3 Assembler in System programming Ch 3 Assembler in System programming
Ch 3 Assembler in System programming
 
Code optimization in compiler design
Code optimization in compiler designCode optimization in compiler design
Code optimization in compiler design
 
System Programming Overview
System Programming OverviewSystem Programming Overview
System Programming Overview
 
Data Encryption Standard (DES)
Data Encryption Standard (DES)Data Encryption Standard (DES)
Data Encryption Standard (DES)
 
1.Role lexical Analyzer
1.Role lexical Analyzer1.Role lexical Analyzer
1.Role lexical Analyzer
 
System Programming- Unit I
System Programming- Unit ISystem Programming- Unit I
System Programming- Unit I
 
System Programming Unit II
System Programming Unit IISystem Programming Unit II
System Programming Unit II
 
Microprocessor chapter 9 - assembly language programming
Microprocessor  chapter 9 - assembly language programmingMicroprocessor  chapter 9 - assembly language programming
Microprocessor chapter 9 - assembly language programming
 
Code Generation
Code GenerationCode Generation
Code Generation
 
MACRO PROCESSOR
MACRO PROCESSORMACRO PROCESSOR
MACRO PROCESSOR
 
Assembler
AssemblerAssembler
Assembler
 
System Programing Unit 1
System Programing Unit 1System Programing Unit 1
System Programing Unit 1
 
Assembler design option
Assembler design optionAssembler design option
Assembler design option
 
Introduction to systems programming
Introduction to systems programmingIntroduction to systems programming
Introduction to systems programming
 
Loaders
LoadersLoaders
Loaders
 
Code Optimization
Code OptimizationCode Optimization
Code Optimization
 
Examinable Question and answer system programming
Examinable Question and answer system programmingExaminable Question and answer system programming
Examinable Question and answer system programming
 
Single pass assembler
Single pass assemblerSingle pass assembler
Single pass assembler
 
Introduction to system programming
Introduction to system programmingIntroduction to system programming
Introduction to system programming
 
Lexical Analysis - Compiler Design
Lexical Analysis - Compiler DesignLexical Analysis - Compiler Design
Lexical Analysis - Compiler Design
 

Similaire à Assembler (20)

MIPS Architecture
MIPS ArchitectureMIPS Architecture
MIPS Architecture
 
Lecture 2 coal sping12
Lecture 2 coal sping12Lecture 2 coal sping12
Lecture 2 coal sping12
 
Creating a Fibonacci Generator in Assembly - by Willem van Ketwich
Creating a Fibonacci Generator in Assembly - by Willem van KetwichCreating a Fibonacci Generator in Assembly - by Willem van Ketwich
Creating a Fibonacci Generator in Assembly - by Willem van Ketwich
 
Fuzzing - Part 1
Fuzzing - Part 1Fuzzing - Part 1
Fuzzing - Part 1
 
Return Oriented Programming
Return Oriented ProgrammingReturn Oriented Programming
Return Oriented Programming
 
Cs4hs2008 track a-programming
Cs4hs2008 track a-programmingCs4hs2008 track a-programming
Cs4hs2008 track a-programming
 
02 isa
02 isa02 isa
02 isa
 
Python
PythonPython
Python
 
Symbol Table, Error Handler & Code Generation
Symbol Table, Error Handler & Code GenerationSymbol Table, Error Handler & Code Generation
Symbol Table, Error Handler & Code Generation
 
C language
C languageC language
C language
 
Assembly language
Assembly languageAssembly language
Assembly language
 
Mips
MipsMips
Mips
 
Online Analytics with Hadoop and Cassandra
Online Analytics with Hadoop and CassandraOnline Analytics with Hadoop and Cassandra
Online Analytics with Hadoop and Cassandra
 
Theperlreview
TheperlreviewTheperlreview
Theperlreview
 
Hash Functions FTW
Hash Functions FTWHash Functions FTW
Hash Functions FTW
 
Cache aware hybrid sorter
Cache aware hybrid sorterCache aware hybrid sorter
Cache aware hybrid sorter
 
Advance ROP Attacks
Advance ROP AttacksAdvance ROP Attacks
Advance ROP Attacks
 
04 pig data operations
04 pig data operations04 pig data operations
04 pig data operations
 
C
CC
C
 
Unit 1 cd
Unit 1 cdUnit 1 cd
Unit 1 cd
 

Plus de Mohd Arif

Bootp and dhcp
Bootp and dhcpBootp and dhcp
Bootp and dhcpMohd Arif
 
Arp and rarp
Arp and rarpArp and rarp
Arp and rarpMohd Arif
 
User datagram protocol
User datagram protocolUser datagram protocol
User datagram protocolMohd Arif
 
Project identification
Project identificationProject identification
Project identificationMohd Arif
 
Project evalaution techniques
Project evalaution techniquesProject evalaution techniques
Project evalaution techniquesMohd Arif
 
Presentation
PresentationPresentation
PresentationMohd Arif
 
Pointers in c
Pointers in cPointers in c
Pointers in cMohd Arif
 
Peer to-peer
Peer to-peerPeer to-peer
Peer to-peerMohd Arif
 
Overview of current communications systems
Overview of current communications systemsOverview of current communications systems
Overview of current communications systemsMohd Arif
 
Overall 23 11_2007_hdp
Overall 23 11_2007_hdpOverall 23 11_2007_hdp
Overall 23 11_2007_hdpMohd Arif
 
Objectives of budgeting
Objectives of budgetingObjectives of budgeting
Objectives of budgetingMohd Arif
 
Network management
Network managementNetwork management
Network managementMohd Arif
 
Networing basics
Networing basicsNetworing basics
Networing basicsMohd Arif
 
Iris ngx next generation ip based switching platform
Iris ngx next generation ip based switching platformIris ngx next generation ip based switching platform
Iris ngx next generation ip based switching platformMohd Arif
 
Ip sec and ssl
Ip sec and  sslIp sec and  ssl
Ip sec and sslMohd Arif
 
Ip security in i psec
Ip security in i psecIp security in i psec
Ip security in i psecMohd Arif
 
Intro to comp. hardware
Intro to comp. hardwareIntro to comp. hardware
Intro to comp. hardwareMohd Arif
 
H.323 vs. cops interworking
H.323 vs. cops interworkingH.323 vs. cops interworking
H.323 vs. cops interworkingMohd Arif
 

Plus de Mohd Arif (20)

Bootp and dhcp
Bootp and dhcpBootp and dhcp
Bootp and dhcp
 
Arp and rarp
Arp and rarpArp and rarp
Arp and rarp
 
User datagram protocol
User datagram protocolUser datagram protocol
User datagram protocol
 
Project identification
Project identificationProject identification
Project identification
 
Project evalaution techniques
Project evalaution techniquesProject evalaution techniques
Project evalaution techniques
 
Presentation
PresentationPresentation
Presentation
 
Pointers in c
Pointers in cPointers in c
Pointers in c
 
Peer to-peer
Peer to-peerPeer to-peer
Peer to-peer
 
Overview of current communications systems
Overview of current communications systemsOverview of current communications systems
Overview of current communications systems
 
Overall 23 11_2007_hdp
Overall 23 11_2007_hdpOverall 23 11_2007_hdp
Overall 23 11_2007_hdp
 
Objectives of budgeting
Objectives of budgetingObjectives of budgeting
Objectives of budgeting
 
Network management
Network managementNetwork management
Network management
 
Networing basics
Networing basicsNetworing basics
Networing basics
 
Lists
ListsLists
Lists
 
Iris ngx next generation ip based switching platform
Iris ngx next generation ip based switching platformIris ngx next generation ip based switching platform
Iris ngx next generation ip based switching platform
 
Ip sec and ssl
Ip sec and  sslIp sec and  ssl
Ip sec and ssl
 
Ip security in i psec
Ip security in i psecIp security in i psec
Ip security in i psec
 
Intro to comp. hardware
Intro to comp. hardwareIntro to comp. hardware
Intro to comp. hardware
 
Heap sort
Heap sortHeap sort
Heap sort
 
H.323 vs. cops interworking
H.323 vs. cops interworkingH.323 vs. cops interworking
H.323 vs. cops interworking
 

Dernier

New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionDilum Bandara
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsNathaniel Shimoni
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESmohitsingh558521
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 

Dernier (20)

New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An Introduction
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directions
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 

Assembler

  • 2. System Software • components – translator • assembler • compiler • interpreter – system manager • operating system – other utilities • loader • linker • DBMS, editor, debugger, ... • purpose of this course – understand how to build system software – understand how these components work 2
  • 3. Issues in System Software • not many in this area – mature area • advanced architectures complicates system software – superscalar CPU – memory model – multiprocessor • new applications – embedded systems – mobile/ubiquitous computing 3
  • 4. Assembler Overview • functions – translate programs written in assembly language to machine code • mnemonic code to machine code • symbols to addresses – handles • constants • literals • addressing • 32 bit constant or address • 32 bit offset 4
  • 5. Assembler Overview (cont’d) • pass 1: loop until the end of the program 1. read in a line of assembly code 2. assign an address to this line • increment N (word addressing or byte addressing) 3. save address values assigned to labels • in symbol tables 4. process assembler directives • constant declaration • space reservation • pass2: same loop 1. read in a line of code 2. translate op code using op code table 3. change labels to address using the symbol table 4. process assembler directives 5. produce object program 5
  • 6. Data Structures for Assembler add $t0, $t1, $t2 000000 01001 01010 01000 00000 100000 • op code table – looked up for the translation of mnemonic code • key: mnemonic code • result: bits – hashing is usually used • once prepared, the table is not changed • efficient lookup is desired • since mnemonic code is predefined, the hashing function can be tuned a priori – the table may have the instruction format and length • to decide where to put op code bits, operands bits, offset bits • for variable instruction size • used to calculate the address 6
  • 7. Data Structures for Assembler (cont’d) .text .globl main • symbol table main: la $t0, array – stored and looked up to assign lw $t1, count address to labels lw $t2, ($t0) loop: • efficient insertion and retrieval lw $t3, 4($t0) is needed ble $t3, $t2, loop2 • deletion does not occur move $t2, $t3 – difficulties in hashing loop2: add $t1, $t1, -1 add $t0, $t0, 4 • non random keys bnez $t1, loop – problem … • the size varies widely …. .data array: .word 3, 5, 5, 1, 6, 7, ….. count: .word 15 string1: .asciiz “nmax = “ 7
  • 8. Symbol Table Construction .text .globl main symbol name value main: main 0 la $t0, array lw $t1, count loop 12 lw $t2, ($t0) loop: loop2 24 lw $t3, 4($t0) ble $t3, $t2, loop2 … move $t2, $t3 array 408 loop2: add $t1, $t1, -1 count 468 add $t0, $t0, 4 bnez $t1, loop string1 472 … bad 478 …. .data array: .word 3, 5, 5, 1, 6, 7, ….. count: .word 15 string1: .asciiz “nmax = “ bad: .word 7 8
  • 9. Assembler Algorithm: pass1 begin if starting address is given LOCCTR = starting address; else LOCCTR = 0; while OPCODE != END do ;; or EOF begin read a line from the code if there is a label if this label is in SYMTAB, then error else insert (label, LOCCTR) into SYMTAB search OPTAB for the op code if found LOCCTR += N ;; N is the length of this instruction (4 for MIPS) else if this is an assembly directive update LOCCTR as directed else error write line to intermediate file end program size = LOCCTR - starting address; end 9
  • 10. Assembler Algorithm: pass2 begin read a line; if op code = START then ;; .globl xxx for MIPS write header record; while op code != END do ;; or EOF begin search OPTAB for the op code; if found if the operand is a symbol then replace it with an address using SYMTAB; assemble the object code; else if is a defined directive add $t0, $t1, $t2 => convert it to object code; 000000 01001 01010 01000 00000 100000 add object code to the text; read next line; end write End record to the text; output text; end 10
  • 11. Program Relocation 0 . . . jump to 1004 1004 . . 1076 5000 . jump to 1004 . . jump to 1004 . 6076 program is loaded at 0 program is loaded at 5000 • motivations for relocation – a program may consists of several pieces of codes that are assembled independently – when a program is assembled, it is impossible to know the exact location where the program starts 11
  • 12. Program Relocation (cont’d) • distances from the origin of a program do not change – make the address relative to the origin – provides loader with information about • which address needs fixing • length of address field – the loader change those addresses as • distance + start address of a program – only absolute addresses need to be changed 12
  • 13. Literals • usage – encoded as an operand (similar to the immediate in MIPS, but different) • load $7, =X’0A7F’ – simple way to declare a constant – assembler does • declare a constant with a label • use the label to use the value • comparison with immediate – literal is an assembler directive • immediate is a machine recognizable data – full word can be used for literals • immediate: full word – (opcode, registers) – values are obtained from data memory - slow • immediate data is within the instruction itself 13
  • 14. Literals (cont’d) • literal pool – assembler collects all the literals into one or more literal pools – default location is at the end of the program • for better code reading – programmer can declare a place (LTORG) • to use PC-relative addressing • to keep data close to instruction • optimization – make one literal for the same value • compare character string or value? – x’454F46’ = c’EOF’ • value comparison needs evaluation • literal table – name(label), operand value, operand length, address in the table – name and value are all used as a key 14
  • 15. Literal Handling Algorithm pass 1 at a recognition of a literal search LITTAB by name if found but different value, error else if the same value, no action else if not found insert a new literal (no address yet) if the code is LTORG or END allocate each literal assigning an address pass 2 replace each literal with the address in the LITTAB if these addresses are absolute, prepare modification for relocation 15
  • 16. Symbol Defining Statement • MAXLEN EQU 4096 – makes program structure better – easier to modify a single location – easier to remember than numbers – registers can be given meaningful names – (maxlen = 4096) in MIPS • assembler – searches SYMTAB and replace the symbol with the value in the table – resulting object code is the same as using the value instead of symbol – remember that with 2 passes there is restriction X EQU Y Y EQU 100 • X cannot be defined in pass 1 16
  • 17. Expressions BUFFER: .space 4096 ; reserve 4096 bytes here BUFEND: ; set current location to BUFFEND (MAXLEN = BUFEND – BUFFER) ; calculate the size of the buffer • allows simple arithmetic operations in symbol definition • operands may have relative values for relocation – relative values should be modified by the loader later • we need to know which is relative – symbol table needs a type field to discern absolute symbols from relative symbols 17
  • 18. Expression Rules • basic – constant is absolute – address is relative • using expressions – expression with absolute arguments is absolute – expression that has multiplication and division is absolute – relative_1 - relative_2 is absolute • dependencies on starting address are canceled out – all the other expressions having relative terms are neither relative nor absolute (error?) • constant - relative • relative_1 + relative_2 • 3 x relative_1 18
  • 19. Program Blocks source object code block 0 block 0 block 1 assembled block 2 block 0 block 1 block 1 block 2 block 2 19
  • 20. Program Blocks (cont’d) • motivation – programmer’s view may be different from machine’s view • affects only efficiency not functionality – addressing can be simplified • large data area can be moved to the end of code while source code places it close to the instructions that use this data • data structure and algorithm – block table (name, block number, address, length) – pass 1 • maintain separate LOCCTR for each block – each label is assigned address relative to the start of the block that contains it • SYMTAB stores block number for each symbol • store starting address of each block in block table – pass 2 • assign address to each symbol by adding the relative address to the block starting address 20
  • 21. Control Sections • control section is a part of program that can be assembled independent of other parts – a large problem can be divided into many control sections – each control section can be developed independently – each control section can be modified independently • symbols defined in other control sections – called external – assembler prepares those symbols – loader & linker resolves the value of external symbols 21
  • 22. Control Sections (cont’d) • a table prepared by assembler – define record • name of symbol defined in this control section • relative address of the symbol – refer record • name of external symbols – modification record • starting address of field to be modified • length of this field • name of external symbol • loader – for every external symbol • find the relative address from the define record • add the starting address of the control section where the symbol is defined • modify the field 22
  • 23. One-Pass Assembler • problem – forward reference: reference to symbols that are not defined yet • why do we need one-pass assembler? – fast • useful for program development and testing • university computing environment • load-and-go assembler – writes the object code on memory not on disk file – since it is on memory it is easy to modify a part of object code 23
  • 24. One-Pass Assembler (cont’d) • one-pass assembler for load-and-go – stores undefined symbols in the SYMTAB with the address of the field that references this symbol – when the symbol is defined later, look up the SYMTAB and modify the field with correct address • there may be many places to be modified • what if object code is written on disk? – bring back the text to memory • efficiency of one-pass assembler cannot be justified – make loader to modify the address at loading time • modification record again • optimization – require all the data declaration be placed at the beginning of the program • reduces reference resolution 24
  • 25. Multi-Pass Assembler • support forwarding reference even though it is bad for program readability at 1, store in a table two tuples (A, 1, B/2, 0) 1: one symbol is missing 1.(A = B/2) 0: no other symbol depends on A 2.(B = C-D) (B, *, , &LB) .... *: don’t know how many symbols missing yet 8. C ..... 9. D ..… LB: list of symbols that depend on B (now, there is only A in this list) at 2, insert (C,*, ,&LC), (D,*, ,&LD) LC and LD contains only B modify (B,*, ,&LB) as (B,2,C-D,&LB) after 8 from LC, B is found change 2 to 1 in the B tuple meaning one symbol remains to be defined after 9 from LD, B is found now evaluate B with defined C, D values since B is done from LB, A is found now A can be evaluated 25