合肥生活安徽新聞合肥交通合肥房產(chǎn)生活服務(wù)合肥教育合肥招聘合肥旅游文化藝術(shù)合肥美食合肥地圖合肥社保合肥醫(yī)院企業(yè)服務(wù)合肥法律

        代做Micro Language Compiler

        時間:2024-02-07  來源:合肥網(wǎng)hfw.cc  作者:hfw.cc 我要糾錯



        Assignment 1: Micro Language Compiler
        1 Introduction
        In this assignment, you are required to design and implement a compiler frontend for Micro
        language which transforms the Micro Program into corresponding LLVM Intermediate Representation (IR) and finally translated to RISC-V assembly code and executable with the help of
        LLVM optimizer and its RISC-V backend. After that, we can execute the compiled program on our
        RISC-V docker container to verify the correctness of your compiler.
        Since it is a senior major elective course, we don’t want to set any limitation for you. You are strongly
        recommended to use Lex/Flex and Yacc/Bison taught in tutorial 3 to design your compiler frontend,
        but it is not forcible. You can choose Any Programming Language you like for this assignment,
        but the RISC-V container we use only has C/C++ toolchain installed and you need to provide me a
        Dockerfile to run your compiler and execute the RISC-V program, which may need some extra effort.
        Some languages also provide tools like Lex and Yacc, and you are free to use them. It is also OK if
        you want to design the scanner and parser by hand instead of using tools.
        2 Micro Language
        Before we move on to the compiler design, it is necessary to have an introduction to Micro Language,
        that serves as the input of our compiler. Actually, it is a very simple language, with limited number
        of tokens and production rules of context-free grammar (CFG):
        • Only integers(i**); No float numbers
        • No Declarations
        • Variable consists of a-z, A-Z, 0-9, at most ** characters long, must start with character and are
        initialized as 0
        • Comments begin with ”−−” and end with end-of-line(EOL)
        • Three kind of statements:
        – assignments, e.g. a:=b+c
        – read(list of IDs), e.g. read(a, b)
        – write(list of Expressions), e.g. write (a, b, a+b)
        • BEGIN, END, READ, WRITE are reserved words
        • Tokens may not extend to the following line
        1
        2.1 Tokens & Regular Expression
        Micro Language has 14 Tokens, and the regular expression for each token is listed below. Since BEGIN
        is a reserved word in C/C++, we need to use the alternative BEGIN as the token class.
        1. BEGIN : begin
        2. END: end
        3. READ: read
        4. WRITE: write
        5. LPAREN: (
        6. RPAREN: )
        7. SEMICOLON: ;
        8. COMMA: ,
        9. ASSIGNOP: :=
        10. PLUSOP: +
        11. MINUSOP: −
        12. ID: [a−zA−Z][a−zA−Z0−9 ]{0,31}
        13. INTLITERAL: −?[0−9]+
        14. SCANEOF: <<EOF>>
        2.2 Context Free Grammar
        Here is the extended context-free grammar (CFG) of Micro Language:
        1. <start> → <program> SCANEOF
        2. <program> → BEGIN <statement list> END
        3. <statement list> → <statement> {<statement>}
        4. <statement> → ID ASSIGNOP <expression>;
        5. <statement> → READ LPAREN <id list> RPAREN;
        6. <statement> → WRITE LPAREN<expr list> RPAREN;
        7. <id list > → ID {COMMA ID}
        8. <expr list > → <expression> {COMMA <expression>}
        9. <expression> → <primary> {<add op> <primary>}
        10. <primary> → LPAREN <expression> RPAREN
        11. <primary> → ID
        12. <primary> → INTLITERAL
        13. <add op> → PLUSOP
        14. <add op> → MINUSOP
        Note: {} means the content inside can appear 0, 1 or multiple times.
        2.3 How to Run Micro Compiler
        Here is a very simple Micro program that we are going to use as the sample program throughout this
        instruction. SCANEOF is the end of line and we do not need to explicitly write it in the program.
        −− Expected Output: 30
        begin
        A := 10;
        B := A + 20;
        write (B);
        end
        We can use our compiler to compile, optimize and execute this program to get expected output 30.
        Note: The exact command to run your compiler is up to you. Just specify out in your report how
        to compile and execute your compiler to get the expected output.
        118010200@c2d52c9b1339:˜/A1$ ./compiler ./testcases/test0.m
        118010200@c2d52c9b1339:˜/A1$ llc −march=riscv64 ./program.ll −o ./program.s
        118010200@c2d52c9b1339:˜/A1$ riscv64−unknown−linux−gnu−gcc ./program.s −o ./program
        118010200@c2d52c9b1339:˜/A1$ qemu−riscv64 −L /opt/riscv/sysroot ./program
        30
        2
        3 Compiler Design
        Figure 1 shows the overall structure of a compiler. In this assignment, your task is to implement the
        frontend only, which contains scanner, parser and intermediate code generator and form a whole →
        compiler with LLVM for Micro language.
        Figure 1: Compiler Structure
        3.1 Scanner
        Scanner takes input character stream and extracts out a series of tokens, and your scanner should
        be able to print out both token class and lexeme for each token. Figure ?? shows an example of the
        sample Micro program.
        118010200@c2d52c9b1339:˜/A1$ ./compiler ./testcases/test0.m −−scan−only
        BEGIN begin
        ID A
        ASSIGNOP :=
        INTLITERAL 10
        SEMICOLON ;
        ID B
        ASSIGNOP :=
        ID A
        PLUOP +
        INTLITERAL 20
        SEMICOLON ;
        WRITE write
        LPAREN (
        ID B
        RPAREN )
        SEMICOLON ;
        END end
        SCANEOF
        3
        3.2 Parser
        Parser receives the tokens extracted from scanner, and generates a parse tree (or concrete syntax tree),
        and futhermore, the abstract syntax tree (AST) based on the CFG. Your compiler should be able to
        print out both the parse tree and the abstract syntax tree, and visualize them with graphviz.
        3.2.1 Parse Tree (Concrete Syntax Tree)
        Figure 2 shows an example of the concrete syntax tree generated from the sample program:
        Figure 2: Concrete Syntax Tree of Sample Program
        3.2.2 Abstract Syntax Tree
        Figure 3 shows an example of the abstract syntax tree generated from the sample program:
        Figure 3: Abstract Syntax Tree of Sample Program
        4
        3.3 Intermediate Code Generator
        For all the assignments in this course, the intermediate representation (IR) of the compiler should be
        the LLVM IR. There are mainly two reasons why LLVM IR is chosen. One is that LLVM IR can take
        advantage of the powerful LLVM optimizer and make up for the missing backend part of compiler in
        this course. The other is that LLVM IR can be easier translated into assembly code for any target
        machine, no matter MIPS, x86 64, ARM, or RISC-V. This functionality can make our compiler more
        compatible to machines with different architecture. The following shows the LLVM IR generated for
        the sample Micro program:
        ; Declare printf
        declare i** @printf (i8 ∗, ...)
        ; Declare scanf
        declare i** @scanf(i8 ∗, ...)
        define i** @main() {
        % ptr0 = alloca i**
        store i** 10, i**∗ % ptr0
        %A = load i**, i**∗ % ptr0
        % 1 = add i** %A, 20
        store i** % 1, i**∗ % ptr0
        %B = load i**, i**∗ % ptr0
        % scanf format0 = alloca [4 x i8 ]
        store [4 x i8 ] c”%d\0A\00”, [4 x i8]∗ % scanf format0
        % scanf str0 = getelementptr [4 x i8 ], [4 x i8]∗ % scanf format0, i** 0, i** 0
        call i** (i8 ∗, ...) @printf (i8∗ % scanf str0 , i** %B)
        ret i** 0
        }
        3.4 Bonus (Extra Credits 10%)
        If you are interested and want to make your compiler better, you may try the following options:
        • Add robust syntax error report
        • Add a symbol table to make your compiler more complete
        • Generate LLVM IR with LLVM C/C++ API instead of simply generating the string
        • Optimize the LLVM IR generation plan for more efficient IR
        • Any other you can think about...
        4 Submission and Grading
        4.1 Grading Scheme
        • Scanner: 20%
        • Parser: 40% (20% for parse tree generator and 20% for AST generation)
        • Intermediate Code Generator: 30%
        We have prepared 10 test cases, and the points for each section will be graded according to the
        number of testcases you passed.
        • Technical Report: 10%
        If your report properly covers the three required aspects and the format is clean, you will get 10
        points.
        5
        • Bonus: 10%
        Refer to section 3.4 for more details. The grading of this part will be very flexible and highly
        depend on the TA’s own judgement. Please specify clearly what you have done for the bonus
        part so that he do not miss anything.
        4.2 Submission with Source Code
        If you want to submit source C/C++ program that is executable in our RISC-V docker container,
        your submission should look like:
        csc4180−a1−118010200.zip
        |−
        |−−− csc4180−a1−118010200−report.pdf
        |−
        |−−− testcases
        |−
        |−−− src
        |−
        |−−−Makefile
        |−−−ir generator.cpp
        |−−−ir generator.hpp
        |−−−node.cpp
        |−−−node.hpp
        |−−−scanner.l
        |−−−parser.y
        |−−−Other possible files
        4.3 Submission with Docker
        If you want to submit your program in a docker, your submission should look like:
        csc4180−a1−118010200.zip
        |−
        |−−− csc4180−a1−118010200.Dockerfile
        |−
        |−−− csc4180−a1−118010200−report.pdf
        |−
        |−−− src
        |−
        |−−−Makefile
        |−
        |−−−run compiler.sh
        |−
        |−−−Your Code Files
        4.4 Technical Report
        Please answer the following questions in your report:
        • How to execute your compiler to get expected output?
        • How do you design the Scanner?
        • How do you design the Parser?
        • How do you design the Intermediate Code Generator?
        • Some other things you have done in this assignment?
        The report doesn’t need to be very long, but the format should be clean. As long as the three questions
        are clearly answered and the format is OK, your report will get full mark.
        如有需要,請加QQ:99515681 或WX:codehelp

        掃一掃在手機打開當前頁
      1. 上一篇:COM3524代做、代寫Java,Python編程設(shè)計
      2. 下一篇:CISC3025代做、代寫Java,c++設(shè)計編程
      3. 無相關(guān)信息
        合肥生活資訊

        合肥圖文信息
        出評 開團工具
        出評 開團工具
        挖掘機濾芯提升發(fā)動機性能
        挖掘機濾芯提升發(fā)動機性能
        戴納斯帝壁掛爐全國售后服務(wù)電話24小時官網(wǎng)400(全國服務(wù)熱線)
        戴納斯帝壁掛爐全國售后服務(wù)電話24小時官網(wǎng)
        菲斯曼壁掛爐全國統(tǒng)一400售后維修服務(wù)電話24小時服務(wù)熱線
        菲斯曼壁掛爐全國統(tǒng)一400售后維修服務(wù)電話2
        美的熱水器售后服務(wù)技術(shù)咨詢電話全國24小時客服熱線
        美的熱水器售后服務(wù)技術(shù)咨詢電話全國24小時
        海信羅馬假日洗衣機亮相AWE  復(fù)古美學(xué)與現(xiàn)代科技完美結(jié)合
        海信羅馬假日洗衣機亮相AWE 復(fù)古美學(xué)與現(xiàn)代
        合肥機場巴士4號線
        合肥機場巴士4號線
        合肥機場巴士3號線
        合肥機場巴士3號線
      4. 上海廠房出租 短信驗證碼 酒店vi設(shè)計

        主站蜘蛛池模板: 中文字幕日本精品一区二区三区| 91福利国产在线观看一区二区 | 日本精品一区二区三区四区| 日本国产一区二区三区在线观看| 国模无码视频一区二区三区| 一区二区高清在线观看| 综合人妻久久一区二区精品| 亚洲av无码片区一区二区三区| 亚洲sm另类一区二区三区| 精品国产一区二区三区www| 国产在线精品一区二区高清不卡 | 亚洲视频在线观看一区| 国产短视频精品一区二区三区| 日韩一区二区三区免费体验| 国产色精品vr一区区三区| 中文字幕亚洲综合精品一区| 精品国产免费观看一区| 中文字幕精品无码一区二区 | 学生妹亚洲一区二区| 日本美女一区二区三区| 国产一区二区电影| 国语对白一区二区三区| 国产一区三区二区中文在线| 久久久91精品国产一区二区三区| 亚洲国产成人久久综合一区| 国产成人精品亚洲一区| 亚洲成AV人片一区二区密柚 | 亚洲一区二区三区偷拍女厕| 亚洲日韩中文字幕一区| 日韩精品无码一区二区三区四区| 国产精品合集一区二区三区| 精品视频在线观看你懂的一区| 亚洲影视一区二区| 午夜视频久久久久一区 | 肥臀熟女一区二区三区| 成人区精品一区二区不卡亚洲| 麻豆一区二区三区蜜桃免费| 亚洲毛片不卡av在线播放一区| 黑人大战亚洲人精品一区| 国产精品538一区二区在线| 国产免费私拍一区二区三区|