A single clock cycle MIPS RISC processor design using VHDL

General Purpose Six-Stage Pipelined Processor

Article

Full-text available

Apr 2013

This paper proposes design of six stage pipelined processor. The architecture is modified to increase the speed of operation. The architecture of the processor includes the ALU, Pipelined data-path, Data forwarding unit, Control logic, data and program memories and Hazard control unit. Hazard detection unit and data forwarding unit have been included for efficient implementation of the pipeline. Design and verification of processor has been done using Verilog on Xilinx 14.1 platform and ASSEMBLER is written in PERL language which decrease the complexity of instruction writing in program memory is used in this design. As a result this design uses 1168 LUTs and achieves 277.9MHz frequency and it achieves 20% better performance on single-thread program than conventional pipelined processor.

A Fastest RISC Processor using Convolution Method

Article

Jul 2017

Performance Improvement in MIPS Pipeline Processor based on FPGA

Conference Paper

Full-text available

Jan 2016

Kirat Pal Singh

The paper describes the design and synthesis of a basic 5 stage pipelined MIPS-32 processor for finding the longer path delay using different process technologies. The large propagation delay or critical path within the circuit and improving the hardware which causes delay is a standard method for increasing the performance. The organization of pipeline stages in such a way that pipeline can be clocked at a high frequency. The design has been synthesized at different process technologies targeting using Spartan3, Spartan6, Virtex4, Virtex5 and Virtex6 devices. The synthesis report indicates that critical path delay is located in execution unit. The maximum critical path delay is 41.405ns at 90nm technology and minimum critical path delay is 6.57ns at 40nm technology. The performance comparison result at different technologies shows that pipeline processor can work at 178MHz in 40nm technology i.e. 49.7% better than other technologies.

Design of High performance MIPS-32 Pipeline Processor

Conference Paper

Full-text available

Apr 2012

The paper describes the design and synthesis of a basic 5 stage pipelined MIPS-32 processor for finding the longer path delay using different process technologies. The large propagation delay or critical path within the circuit and improving the hardware which causes delay is a standard method for increasing the performance. The organization of pipeline stages in such a way that pipeline can be clocked at a high frequency. The design has been synthesized at different process technologies targeting using Spartan3, Spartan6, Virtex4, Virtex5 and Virtex6 devices. The synthesis report indicates that critical path delay is located in execution unit. The maximum critical path delay is 41.405ns at 90nm technology and minimum critical path delay is 6.57ns at 40nm technology. The performance comparison result at different technologies shows that pipeline processor can work at 178MHz in 40nm technology i.e. 49.7% better than other technologies.

Design and Development of Microarchitecture for Dynamic IoT Communication

Article

Nov 2021

Nowadays, microarchitecture has utilized many digital applications to enrich the performance of the gadget. The microarchitecture is effectively applicable in the Internet of Things (IoT) to maximize communication performance by designing a specific processor. Usually, the microarchitecture in IoT is structured in a dynamic environment to handle the multiple diverse works simultaneously. But the damaged or weak processor can push the process of microarchitecture into trouble by consuming more energy. So, the awareness of microarchitecture merits and limitations is the needed assessment to select a good processor. Hence, this current article has prepared a detailed review of energy efficiency microarchitecture in IoT gadgets and their functions to accelerate communication. Several literature works were discussed with their advances and limitations in both table and graphical way. Finally, the discussion section has elaborated on the common defeats in the reviewed literature and its reason. Finally, future works have directed the following studies to improve the microarchitecture efficiency score.

DESIGN of a 32-BIT SINGLE-CYCLE MIPS RISC PROCESSOR

Article

Full-text available

Aug 2021

Mahmut Efil

In this paper, the design of a 32-bit single-cycle MIPS RISC Processor in terms of simulation is realized using the VHDL programming language. The RISC computer architecture has hardware infrastructure that eases the implementation of the instruction network to perform load and store instructions. The MIPS RISC processor has different instruction formats as R-type, I-type, and J-type and these instructions realize specific operations decoded into their bits in the datapath of the overall structure. There are several blocks that realize and control the processor architecture and the network of the MIPS is divided into different stages based on the operation realized. Once the datapath and the necessary connections inside the processor architecture are terminated, the ALU will calculate the desired operation using instructions. The Artix-7 (xc7a35tcpg236-1) kit inside the Xilinx ISE Design Suite is used and implemented for simulation purposes. In the end, the results based on the instruction sets are obtained and interpreted.

Design and implementation of dual-core MIPS processor for LU decomposition based on FPGA

Article

Full-text available

Apr 2021
IJECE

Many systems like the control systems and in communication systems, there is usually a demand for matrix inversion solution. This solution requires many operations, which makes it not possible or very hard to meet the needs for real-time constraints. Methods were exists to solve this kind of problems, one of these methods by using the LU decomposition of matrix which is a good alternative to matrix inversion. The LU matrices are two matrices, the L matrix, which is a lower triangular matrix, and the U matrix, which is an upper triangular matrix. In this paper, a design of dual-core processor is used as the hardware of the work and certain software was written to enable the two cores of the dual-core processor to work simultaneously in computing the value of the L matrix and U matrix. The result of this work are compared with other works that using single-core processor, and the results found that the time required in the cores of the dual-core is more less than using single-core. The designed dual-core processor is invoked using the VHDL language.

32-Bit RISC processor with floating point unit for DSP applications

Conference Paper

Full-text available

May 2016

With the advent of technology, digital signal processing applications are flourishing prominently in space, medical and many commercial related areas. RISC processor is the heart of many high speed applications of embedded and digital signal processing. Floating point representation has prevalent ascendancy over fixed point numbers as it endeavors dynamic range of values. Hence in this paper a high speed MIPS based 32 bit RISC processor with single precision floating point unit for DSP applications is proposed. The inclination of the entire design is towards improving the performance of floating point arithmetic unit so as the performance of the entire RISC processor is ameliorated. The proposed processor is proficient of executing arithmetic, logical, floating point, data transfer, memory, shifting and rotating instructions. The complex multiplication are frequently used in the DSP applications and thus a special instruction for complex multiplication is incorporated. The multiplication engross most of the time, power and area of any operation, on that account the multiplier are reduced in number from four to two as compared to conventional complex multiplication method. The design is coded in Verilog HDL, simulated on Xilinx ISE 13.1 and synthesized on Spartan 6. Results indicates that the proposed design is optimized in speed as well as in area.

Multicore RISC Processor Implementation by VHDL for Educational Purposes

Conference Paper

Full-text available

May 2015

With trends computer manufacturers to build computers that have Multicore processors, it becomes necessary to study the hardware architecture of this processor and the way of manage data between Cores. All the previous researches were designing single cycle processors or pipeline processors by FPGA (Field Programmable Gate Array). This is a first research work on parallel processing to design and implement a Multicore processor by FPGA. In this work Multicore processor has two Cores and each Core consists of 5-stage pipeline MIPS (Microprocessor without Interlocked Pipeline Stages) RISC (Reduced Instruction Set Computer) processor. Separated data cache and instruction cache were added to each Core. MESI (Modified, Exclusive, Shared and Invalid) protocol is used to manage cache coherence and memory coherence which support Write-back policy where replacement algorithm is not needed. Many programs are tested on this design and the correct results were obtained. The VHDL (Very high speed integrated circuit Hardware Description Language) of the complete Multicore processor is implemented by using (Xilinx ISE Design Suite 13.4) Software and configured on FPGA Spartan-3AN starter kit and results from the kit were obtained.

Dynamically scalable dual-core pipelined processor

Article

Feb 2015

This article proposes design and architecture of a dynamically scalable dual-core pipelined processor. Methodology of the design is the core fusion of two processors where two independent cores can dynamically morph into a larger processing unit, or they can be used as distinct processing elements to achieve high sequential performance and high parallel performance. Processor provides two execution modes. Mode1 is multiprogramming mode for execution of streams of instruction of lower data width, i.e., each core can perform 16-bit operations individually. Performance is improved in this mode due to the parallel execution of instructions in both the cores at the cost of area. In mode2, both the processing cores are coupled and behave like single, high data width processing unit, i.e., can perform 32-bit operation. Additional core-to-core communication is needed to realise this mode. The mode can switch dynamically; therefore, this processor can provide multifunction with single design. Design and verification of processor has been done successfully using Verilog on Xilinx 14.1 platform. The processor is verified in both simulation and synthesis with the help of test programs. This design aimed to be implemented on Xilinx Spartan 3E XC3S500E FPGA.

dynamically scalable dual core pipelined processor

Article

Feb 2015

This article proposes design and architecture of a dynamically scalable dual-core pipelined processor. Methodology of the design is the core fusion of two processors where two independent cores can dynamically morph into a larger processing unit, or they can be used as distinct processing elements to achieve high sequential performance and high parallel performance. Processor provides two execution modes. Mode1 is multiprogramming mode for execution of streams of instruction of lower data width, i.e., each core can perform 16-bit operations individually. Performance is improved in this mode due to the parallel execution of instructions in both the cores at the cost of area. In mode2, both the processing cores are coupled and behave like single, high data width processing unit, i.e., can perform 32-bit operation. Additional core-to-core communication is needed to realise this mode. The mode can switch dynamically; therefore, this processor can provide multifunction with single design. Design and verification of processor has been done successfully using Verilog on Xilinx 14.1 platform. The processor is verified in both simulation and synthesis with the help of test programs. This design aimed to be implemented on Xilinx Spartan 3E XC3S500E FPGA.

Simulation and Verification of Self Test 16Bit Processor

Article

Apr 2011

Manoranjan Pradhan

This paper presents the design and verification of 16 bit processor. The Booth multiplier and restoring division are integrated in to the ALU of the proposed processor. The processor is described in structural level to verify the general understanding of the system. The processor has 16-bit instruction based on three different format R-format, I-format and J-format. The control unit generates all the control signals needed to control the coordination among the entire component of the processor. All the modules in the design are coded in VHDL (very high speed integrated circuit hardware description language) to ease the description, verification, simulation and hardware implementation. The design entry, synthesis, and simulation of processor are done by using Xilinx ISE 10.1 software and implemented on XC2S200-6pq208 Spartan-II FPGA device.

Design and Implementation of 6-Stage 64-bit MIPS Pipelined Architecture

Article

Full-text available

Oct 2019

Pipelining is the concept of overlapping of multiple instructions to perform their operations to optimize the time and ability of hardware units. This paper presents the design and implementation of 6 stage pipelined architecture for High performance 64-bit Microprocessor without Interlocked Pipeline Stages (MIPS) based Reduced Instruction set computing (RISC) processor. In this work, combining efforts of pre-fetching unit, forwarding unit, Branch and Jump predicting unit, Hazard unit are used to reduce the hazards. Low power unit is used to minimize the power. Cache Memories, other devices and especially balancing pipeline stages optimize the Speed in this work. DDR4 SDRAM (Double Data Rate type4 Synchronous Dynamic Random Access Memory) controller is employed in this pipeline to achieve high-speed data transfers and to manage the entire system efficiently. Low power, Low delay Flip flops are used in pipeline registers that implicitly enhance the performance of the system. The proposed method provides better results compared to the existing models. The simulation and synthesis results of the proposed Architecture are evaluated by Xilinx 14.7 software and supporting graphs are plotted through MATLAB tool

FIVE STAGE PIPELINED MIPS PROCESSOR VERIFICATION SEQUENCE MODULE USING UVM

Conference Paper

Apr 2023

Five Stage Pipelined MIPS Processor Verification Coverage Module Using UVM

Conference Paper

Mar 2023

FPGA Implementation of a Novel Dual - BRAM Processor Architecture

Conference Paper

Jul 2020

Design and optimization of a divider in pipelined processors based on quotient-try

Conference Paper

Jun 2017

Desenvolvimento de um Processador Baseado na Arquitetura MIPS32 Utilizando Hardware Reconfigurável

Article

Full-text available

Sep 2017

This paper presents the process of developing a 32-bit MIPS processor with reduced functionality, using concepts of reconﬁgurable computing through the synthesis on Field Programmable Gate Array (FPGA) using VHDL language to describe the hardware. The result of this project could be used as teaching material in related disciplines, besides offering a validated IP of MIPS 32 bits processor, which can be further used as part of various research projects in microelectronics, computer architecture and hardware description languages.

Implementação de um Processador Baseado no MIPS32 Utilizando Hardware Reconfigurável

Thesis

Full-text available

Feb 2016

Esse trabalho apresenta o processo de desenvolvimento de um processador MIPS de 32 bits com funcionalidades reduzidas, utilizando conceitos de computação reconfigurável, por meio da sintetização em FPGA utilizando a linguagem VHDL para descrição do hardware. Além do processador, são abordadas técnicas de construção de uma plataforma de interação com o usuário, denominada MIPSDUINO32. Essa, desenvolvida utilizando Arduino, tem como objetivo proporcionar a seus usuários uma interface transparente para comunicação com o processador desenvolvido, sem que esses necessitem compreender técnicas e ferramentas de simulação de códigos em VHDL, e processos de baixo nível para comunicação. O resultado desse projeto poderá ser utilizado como material didático em disciplinas relacionadas no IFMG campus Formiga, além de oferecer a esse, um IP validado do processador MIPS 32 bits, que poderá ser futuramente utilizado como parte de variados projetos de pesquisa na área de microeletrônica, arquitetura de computadores e linguagens de descrição de hardware.

Vhdl Implementation of A Mips-32 Pipeline Processor

Article

Full-text available

Jan 2012

This paper presents the design and implement a basic five stage pipelined MIPS-32 CPU. Particular attention will be paid to the reduction of clock cycles for lower instruction latency as well as taking advantage of high-speed components in an attempt to reach a clock speed of at least 100 MHz. The final results allowed the CPU to be run at over 200 MHz with a very reasonable chip area of around 900,000nm2.

An efficient high speed RISC processor for convolution

Conference Paper

Jan 2015

Many algorithms have been design in order to accomplish an improved the performance of the filters by using the convolution design. The architecture of the proposed RISC CPU is a uniform 32-bit instruction format, single cycle non-pipelined processor. It has load/store architecture, where the operations will only be performed on registers, and not on memory locations. It follows the classical von-Neumann architecture with just one common memory bus for both instructions and data. A total of 27 instructions are designed in initial development step of the processor. The instruction set consists of Logical, Immediate, Jump, Load, store and HALT type of instruction. The combined advantages RISC processor such as high speed, low power, area efficient and operation-specific design possibilities have been analyzed.

Design and simulation of 32-Bit RISC architecture based on MIPS using VHDL

Conference Paper

Jan 2015

VHDL Very High Speed Integrated Circuits Hardware Description Language) is widely used for ASIC (Application Specific Integrated Circuits) emulation, as well as a solution for applications with high volatility. FPGA (Field Programmable Gate Array) give quick time to market, and its feature of re-programmability often makes them the main part of the system. This paper presents the design of a RISC (Reduced Instruction Set Computer) CPU architecture based on MIPS (Microprocessor Interlock Pipeline Stages) using VHDL. It also describes the instruction set, architecture and timing diagram of the processor. Floating point number to fixed number conversion is the main task while working on this numbers, this conversion has been achieved by using Float to Fixed number converter module. Finally, design, synthesis and simulation of the proposed RISC Processor based on MIPS has been achieved using Xilinx ISE 13.1i Simulator and coding is written in VHDL language.

Vhdl Implementation of A Mips-32 Pipeline Processor

Conference Paper

Full-text available

Nov 2012

This paper presents the design and implement a basic five stage pipelined MIPS-32 CPU. Particular attention will be paid to the reduction of clock cycles for lower instruction latency as well as taking advantage of high-speed components in an attempt to reach a clock speed of at least 100 MHz. The final results allowed the CPU to be run at over 200 MHz with a very reasonable chip area of around 900,000 nm2.

Synthesis and Simulation of a 32Bit MIPS RISC Processor using VHDL

Conference Paper

Aug 2014

The main objective of the project is to design and simulate 32Bit MIPS (Microprocessor Interlocked Pipeline Stages) RISC (Reduced Instruction Set Computer) Processor using VHDL (Very High Speed Integrated Circuit Hardware Description Language). In this paper, we analyze Instruction fetch module, Decoder module, Execution module which includes 32Bit Floating point ALU, Flag register of 32Bit, MIPS Instruction Set, and 32Bit general purpose registers and design theory based on 32Bit MIPS RISC Processor. Furthermore, we use pipeline concept which involves Instruction Fetch, Instruction Decode, Execution, Memory and Write Back modules of MIPS RISC processor based on 32Bit MIPS Instruction set in a single clock cycle. All the modules in the design are coded in VHDL, as it is very useful language with its concept of concurrency to cope successfully with the parallelism of digital hardware. Finally, Synthesis and Simulation of the design is done in XILINX 13.1i ISE Simulator.

MIPS Processor With Reduced Dynamic Power

Article

Mar 2013

Five stage pipelined MIPS processor architecture with reduced dynamic power and improved clock cycles per instruction and million instructions per second is proposed in this paper. To eliminate hazards which are introduced in the pipelined processors, NOP instruction is added. NOP instructions do not contribute to any useful work, so the power consumed during NOP instruction is wasted. In the proposed architecture dual write port register file is used to support dual write-back operation, which reduces the number of NOP instruction in the pipeline and further reduces the dynamic power. The processor architecture is described using verilog and synthesized using Xilinx ISE 14.1.

Design of an academic microcontroller and its application to Authenticated Encryption

Conference Paper

Feb 2014

In this work we present the design and implementation on FPGAs of a 16 bit microcontroller. Since this microcontroller has academic purposes, its architecture is simple, complete and open. Furthermore, its assembler language was also designed and a translator program in Python is provided. The arithmetic/logic unit designed only include integer operations, and we aggregate a carry free multiplier to make easy some computations needed in cryptographic and coding theory applications which require binary field arithmetic. To demonstrate its functionality we choose a non trivial application, the implementation of two Authenticated Encryption schemes CCM and GCM. Our design can be used in economic range FPGAs as Spartan 3 or faster range as Virtex 5 or higher.

A VLSI Design approach for RISC based MIPS architecture

Conference Paper

Jan 2012

This paper describes the design and analysis of the functional units of RISC based MIPS architecture. The functional units includes the Instruction fetch unit, instruction decode unit, execution unit, data memory and control unit. The functions of these modules are implemented by pipeline without any interlocks and are simulated successfully on Modelsim 6.3f and Xilinx 9.2i. It also attempts to achieve high performance with the use of a simplified instruction set.

A Novel Approach for VHDL Implementation of Universal Line Encoder for Communication

Article

Jul 2010

This paper describes the implementation of various line coding schemes using VHDL on a single chip and enables the user to select one of them for the purpose of security, area optimization and can support communication in varying channel environment. The choice of line code depends upon presence or absence of DC level, power spectral density, bandwidth, BER performance, ease of clock signal recovery and presence or absence of inherent error detection property. The line encoding schemes used are Unipolar RZ and NRZ, Polar RZ and NRZ, AMI and Manchestor codings. Select pin impinged on the chip enables the users to select any one of the line encoding technique according to their requirement. The waveforms of Universal Line Encoder are presented using Modelsim 6.4.

Design of FPGA based 8-bit RISC controller IP core using VHDL

Conference Paper

Full-text available

Dec 2012

This paper describes the design, development and implementation of an 8-bit RISC controller IP core. The controller has been designed using Very high speed integrated circuit Hardware Description Language (VHDL). The design constraints are speed, power and area. This controller is efficient for specific applications and suitable for small applications. This non-pipelined controller has four units: - Fetch, Decode, Execute and a stage control unit. It has an in built program and data memory. Also it has four ports for communicating with other I/O devices. A hierarchical approach has been used so that basic units can be modeled using behavioral programming. The basic units are combined using structural programming. The design has been implemented using ALTERA STRATIX II FPGA.

Design, implementation, and evaluation of a low-complexity vector-core for executing scalar/vector instructions

Article

Jun 2013
J PARALLEL DISTR COM

Mostafa I. Soliman

This paper proposes a low-complexity vector-core called LcVc for executing both scalar and vector instructions on the same execution datapath. A unified register file in the decode stage is used for storing both scalar operands and vector elements. The execution stage accepts a new set of operands each cycle and produces a new result. Rather than issuing a vector instruction (1-D operations) as a whole, each vector operation is issued sequentially with the existing scalar issue hardware. In the first implementation of LcVc, all loads and stores of registers take place from the data cache in the memory access stage in a rate of one element per clock cycle. The complete design of our proposed LcVc processor is implemented using VHDL targeting the Xilinx FPGA Spartan 3E, xc3s1600e-4-fg320 device. The total number of slices required for implementing LcVc is 1778, where the number of slice flip-flops is 538 and the number of 4-input LUTs is 3706: 1914 for logic and 1792 for RAMs. Moreover, our performance evaluation results show that the speedup of executing vector addition, vector scaling, SAXPY, and matrix–matrix multiplication on LcVc over the scalar execution are 2.3, 2.5, 1.9, and 3, respectively. The hardware required to support the enhanced vector capability is insignificant (5%), which results in reducing the area per core and increasing the number of cores available in a given chip area.

Performance enhancement of VHDL-AMS for DSP design

Conference Paper

Jan 2005

Tony Tsang

The behavioural modelling of VHDL-AMS is a key factor in the development of analog and mixed-signal designs for communication devices. This paper presents a framework for the development of programmable mixed-signal devices, which integrates both programmable analog and digital circuits. The framework uses a VHDL-AMS based language, called VHDL-AMS-RTS, to describe the real-time domain and stochastic behaviour to adapt the simulation and performance analysis. The real-time stochastic statements of VHDL-AMS-RTS are added to the VHDL-AMS, which include time ordering and time constraint, probabilistic behaviour and quantitative description of mixed-signal devices. With this behavioural modelling environment it is possible to predict and optimize the analog and digital hardware using simulation but with a lower computational time and cost. To demonstrate the usefulness of the framework, we apply it to the structural performance analysis of soft input soft output (SISO) module of turbo decoding.

A single clock cycle MIPS RISC processor design using VHDL

Abstract

No full-text available

Recommended publications

Single core hardware modeling of 32-bit MIPS RISC processor with a single clock