S. Banerjee

University of California, Irvine, Irvine, CA, USA

Are you S. Banerjee?

Claim your profile

Publications (12)3.66 Total impact

  • Conference Proceeding: A comparative study between different structures of rail and actuator used in electromagnetic levitation systems
    S. Banerjee, P. Biswas, R. Bhaduri, P. Sarkar
    [show abstract] [hide abstract]
    ABSTRACT: In this paper an analysis and design of different structure of rail (guide-way) and electromagnetic levitation system (EMLS) has been performed utilizing ANSYS software. For the successful implementation of any EMLS the proper selection and design of actuator and guide-way is important. The design of actuator is primarily controlled by the input power to lift power ratio and lift power magnet weight ratio [5]. These factors are dependent on the magnet dimensions, required gap flux and hence the required current density in the winding. The magnet configurations chosen on the basis of required pole-face area and necessary window area to house the excitation coils. There are various magnet and rail geometries; i.e. magnet with I, U and E profiles and various winding arrangements with flat and U-profile rail. In this work a FEM based analysis has done to find out the flux pattern, working flux density, field intensity, force etc. for different single actuator based levitation system. The main objective is to propose a suitable configuration of actuator and guide-rail for a specific DC electromagnetic levitation system..
    Power Electronics, Machines and Drives (PEMD 2010), 5th IET International Conference on; 05/2010
  • Conference Proceeding: Frequency response based dynamic performance analysis of switched mode power amplifiers used in electromagnetic levitation systems
    S. Banerjee, R. Bhaduri, P. Biswas
    [show abstract] [hide abstract]
    ABSTRACT: PSPICE is a general-purpose analog and digital electronic circuit simulator. It is a powerful program that is used in IC and board-level design to check the integrity of circuit designs and to predict circuit behaviour. The DC electromagnetic levitation system (EMLS) critically deserves very fast response and the DC-DC switch mode power amplifiers used for such applications have important role to meet this demand. The coil current for the magnets used in EMLS needs to be precisely controlled to meet the attractive force demand and this calls for a fast DC to DC power amplifier that can be controlled in a closed loop fashion. The main objective of this paper is to compare the dynamic performances between these different power amplifiers so that an insight can be made about the selection of power amplifier where speed of response and bandwidth is the prime criterion. The effect of different parameters of chopper amplifier (input DC link voltage, duty cycle and switching frequency) and levitation system (resistance and inductance of magnet coil) on the dynamic responses of amplifiers also been observed both by time and frequency response analysis utilizing PSPICE. This will lead to an inference for the design and development of chopper amplifier and DC EMLS. Experimental results to show the difference in dynamic responses between selective power circuits have also been presented.
    Power Electronics, Machines and Drives (PEMD 2010), 5th IET International Conference on; 05/2010
  • Source
    Article: Exploiting Application Data-Parallelism on Dynamically Reconfigurable Architectures: Placement and Architectural Considerations
    S. Banerjee, E. Bozorgzadeh, N. Dutt
    [show abstract] [hide abstract]
    ABSTRACT: Partial dynamic reconfiguration, often called run-time reconfiguration (RTR), is a key feature in modern reconfigurable platforms. In this paper, we present parallelism granularity selection (PARLGRAN), an application mapping approach that maximizes performance of application task chains on architectures with such capability. PARLGRAN essentially selects a suitable granularity of data-parallelism for individual data parallel tasks while considering key issues such as significant reconfiguration overhead and placement constraints. It integrates granularity selection very effectively in a joint scheduling and placement formulation, necessary due to constraints imposed by partial RTR. As a key step to validating PARLGRAN, we additionally present an exact strategy (integer linear programming formulation). We demonstrate that PARLGRAN generates high-quality schedules with: (1) a set of small test cases where we compare our results with the exact strategy; (2) a very large set of synthetic experiments with over a thousand data-points where we compare it with a simpler strategy that tries to statically maximize data-parallelism, i.e., only considers resource availability; and (3) a detailed application case study of JPEG encoding. The application case-study confirms that blindly maximizing data-parallelism can result in schedules even worse than that generated by a simple (but RTR-aware) approach oblivious to data-parallelism. Last, but very important, we demonstrate that our approach is well-suited for true on-demand computing with detailed execution time estimates on a typical embedded processor. Heuristic execution time is comparable to task execution time, i.e., it is feasible to integrate PARLGRAN in a run-time scheduler for dynamically reconfigurable architectures.
    IEEE Transactions on Very Large Scale Integration (VLSI) Systems 03/2009; · 1.22 Impact Factor
  • Source
    Conference Proceeding: Energy-aware co-processor selection for embedded processors on FPGAs
    [show abstract] [hide abstract]
    ABSTRACT: In this paper, we present co-processor selection problem for minimum energy consumption in hw/sw co-design on FPGAs with dual power mode. We provide theoretical analysis for the problem under no constraint, resource constraint, and timing constraint. We prove that the complexity of the problem in each case is NP-Hard and we provide a generalized ILP formulation. We compared the result of our approach in minimizing energy to the result of other approaches that had not considered both static and dynamic power during optimization and we showed that we can reduce energy by 63% in some cases.
    Computer Design, 2007. ICCD 2007. 25th International Conference on; 11/2007
  • Article: Integrating Physical Constraints in HW-SW Partitioning for Architectures With Partial Dynamic Reconfiguration
    S. Banerjee, E. Bozorgzadeh, N. D. Dutt
    [show abstract] [hide abstract]
    ABSTRACT: Partial dynamic reconfiguration is a key feature of modern reconfigurable architectures such as the Xilinx Virtex series of devices. However, this capability imposes strict placement constraints such that even exact system-level partitioning (and scheduling) formulations are not guaranteed to be physically realizable due to placement infeasibility. We first present an exact approach for hardware-software (HW-SW) partitioning that guarantees correctness of implementation by considering placement implications as an integral aspect of HW-SW partitioning. Our exact approach is based on integer linear programming (ILP) and considers key issues such as configuration prefetch for minimizing schedule length on the target single-context device. Next, we present a physically aware HW-SW partitioning heuristic that simultaneously partitions, schedules, and does linear placement of tasks on such devices. With the exact formulation, we confirm the necessity of physically-aware HW-SW partitioning for the target architecture. We demonstrate that our heuristic generates high-quality schedules by comparing the results with the exact formulation for small tests and with a popular, but placement-uanaware scheduling heuristic for a large set of over a hundred tests. Our final set of experiments is a case study of JPEG encoding-we demonstrate that our focus on physical considerations along with our consideration of multiple task implementation points enables our approach to be easily extended to handle heterogenous architectures (with specialized resources distributed between general purpose programmable logic columns). The execution time of our heuristic is very reasonable-task graphs with hundreds of nodes are processed (partitioned, scheduled, and placed) in a couple of minutes
    IEEE Transactions on Very Large Scale Integration (VLSI) Systems 12/2006; · 1.22 Impact Factor
  • Source
    Conference Proceeding: Design space exploration of real-time multi-media MPSoCs with heterogeneous scheduling policies
    [show abstract] [hide abstract]
    ABSTRACT: Real-time multi-media applications are increasingly being mapped onto MPSoC (multi-processor system-on-chip) platforms containing hardware-software IPs (intellectual property) along with a library of common scheduling policies such as EDF, RM. The choice of a scheduling policy for each IP is a key decision that greatly affects the design's ability to meet real-time constraints, and also directly affects the energy consumed by the design. We present a cosynthesis framework for design space exploration that considers heterogenous scheduling while mapping multimedia applications onto such MPSoCs. In our approach, we select a suitable scheduling policy for each IP such that system energy is minimized - our framework also includes energy reduction techniques utilizing dynamic power management. Experimental results on a realistic multi-mode multi-media terminal application demonstrate that our approach enables us to select design points with up to 60.5% reduced energy for a given area constraint, while meeting all real-time requirements. More importantly, our approach generates a tradeoff space between energy and cost allowing designers to comparatively evaluate multiple system level mappings.
    Hardware/software codesign and system synthesis, 2006. CODES+ISSS '06. Proceedings of the 4th international conference; 11/2006
  • Source
    Article: ISEGEN: an iterative improvement-based ISE generation technique for fast customization of processors
    [show abstract] [hide abstract]
    ABSTRACT: Customization of processor architectures through instruction set extensions (ISEs) is an effective way to meet the growing performance demands of embedded applications. A high-quality ISE generation approach needs to obtain results close to those achieved by experienced designers, particularly for complex applications that exhibit regularity: expert designers are able to exploit manually such regularity in the data flow graphs to generate high-quality ISEs. In this paper, we present ISEGEN, an approach that identifies high-quality ISEs by iterative improvement following the basic principles of the well-known Kernighan-Lin min-cut heuristic. Experimental results on a number of MediaBench, EEMBC, and cryptographic applications show that our approach matches the quality of the optimal solution obtained by exhaustive search. We also show that our ISEGEN technique is on average 20times faster than a genetic formulation that generates equivalent solutions. Furthermore, the ISEs identified by our technique exhibit 35% more speedup than the genetic solution on a large cryptographic application by effectively exploiting its regular structure
    IEEE Transactions on Very Large Scale Integration (VLSI) Systems 08/2006; · 1.22 Impact Factor
  • Source
    Conference Proceeding: PARLGRAN: parallelism granularity selection for scheduling task chains on dynamically reconfigurable architectures
    S. Banerjee, E. Bozorgzadeh, N. Dutt
    [show abstract] [hide abstract]
    ABSTRACT: Partial dynamic reconfiguration, often called RTR (run-time reconfiguration) is a key feature in modern reconfigurable platforms. While partial RTR enables additional application performance, it imposes physical constraints necessitating simultaneous scheduling and placement while mapping application task graphs onto such architectures. In this paper, we present PARLGRAN, an approach that maximizes performance of application task chains by selecting a suitable granularity of data-parallelism for individual data parallel tasks. Our approach focuses on reconfiguration delay overhead and placement-related issues (such as fragmentation) while selecting individual data-parallelism granularity as an integral part of simultaneous scheduling and placement. We demonstrate that our heuristic generates high-quality schedules on an extensive set of over a 1000 synthetic experiments by comparing the results with an approach that tries to statically maximize data-parallelism, i.e., does not consider the overheads and constraints associated with partial RTR. A detailed case-study on JPEG encoding additionally confirms that blindly maximizing data-parallelism can result in schedules even worse than that generated by a simple (but RTR-aware) approach oblivious to data-parallelism.
    Design Automation, 2006. Asia and South Pacific Conference on; 02/2006
  • Source
    Conference Proceeding: Performance and energy benefits of instruction set extensions in an FPGA soft core
    [show abstract] [hide abstract]
    ABSTRACT: Performance of applications can be boosted by executing application-specific instruction set extensions (ISEs) on a specialized hardware coupled with a processor core. Many commercially available customizable processors have communication overheads in their interface with the specialized hardware. However, existing ISE generation approaches have not considered customizable processors that have communication overheads at their interface. Furthermore, they have not characterized the energy benefits of such ISEs. We present a soft-processor customization framework that takes an input 'C' application and realizes a customized processor capturing the microarchitectural details of its interface with the specialized unit. We are able to accurately measure the speedup, energy, power and code size benefits of our ISE approach on a real system implementation by applying the design flow to a popular Xilinx Microblaze soft-processor core synthesized for four real-life applications. We show that only one large ISE per application is sufficient to get an average 1.41× speedup over pure software execution in spite of incurring communication overheads in the ISE implementation. We also observe a simultaneous savings in energy (up to 40%) and power (up to 12% peak power reduction) with this increased performance.
    VLSI Design, 2006. Held jointly with 5th International Conference on Embedded Systems and Design., 19th International Conference on; 02/2006
  • Source
    Conference Proceeding: Physically-aware HW-SW partitioning for reconfigurable architectures with partial dynamic reconfiguration
    S. Banerjee, E. Bozorgzadeh, N. Dutt
    [show abstract] [hide abstract]
    ABSTRACT: Many reconfigurable architectures offer partial dynamic configurability, but current system-level tools cannot guarantee feasible implementations when exploiting this feature. We present a physically aware hardware-software (HW-SW) scheme for minimizing application execution time under HW resource constraints, where the HW is a reconfigurable architecture with partial dynamic reconfiguration capability. Such architectures impose strict placement constraints that lead to implementation infeasibility of even optimal scheduling formulations that ignore the nature of these constraints. We propose an exact and a heuristic formulation that simultaneously partition, schedule, and do linear placement of tasks on such architectures. With our exact formulation, we prove the critical nature of placement constraints. We demonstrate that our heuristic generates high-quality schedules by comparing the results with the exact formulation for small tests and a popular, but placement-unaware scheduling heuristic for larger tests. With a case study, we demonstrate extension of our approach to handle heterogenous architectures with specialized resources distributed between general purpose programmable logic columns. The execution time of our heuristic is very reasonable - task graphs with hundreds of nodes are processed in a couple of minutes.
    Design Automation Conference, 2005. Proceedings. 42nd; 07/2005
  • Conference Proceeding: Considering run-time reconfiguration overhead in task graph transformations for dynamically reconfigurable architectures
    S. Banerjee, E. Bozorgzadeh, N. Dutt
    [show abstract] [hide abstract]
    ABSTRACT: In modern dynamic FPGA-based platforms where multiple processes may be executing concurrently, partial dynamic reconfiguration (RTR) is a key technique for maximizing application performance under resource constraints. For platforms with column-based partial RTR, we propose a new technique to statically transform linear task graphs (common in image processing applications). In our approach, the granularity of data parallelism for each task is determined while considering the reconfiguration overhead along with architectural constraints imposed by partial RTR. On JPEG applications, our technique can improve the execution time by up to 37% by choosing the right granularity of task parallelism.
    Field-Programmable Custom Computing Machines, 2005. FCCM 2005. 13th Annual IEEE Symposium on; 05/2005
  • Conference Proceeding: Efficient search space exploration for HW-SW partitioning
    S. Banerjee, N. Dutt
    [show abstract] [hide abstract]
    ABSTRACT: Hardware/software (HW-SW) partitioning is a key problem in the codesign of embedded systems, studied extensively in the past. One major open challenge for traditional partitioning approaches - as we move to more complex and heterogeneous SoCs - is the lack of efficient exploration of the large space of possible HW/SW configurations, coupled with the inability to efficiently scale up with larger problem sizes. We make two contributions for HW-SW partitioning of applications represented as procedural call-graphs: 1) we prove that during partitioning, the execution time metric for moving a vertex needs to be updated only for the immediate neighbours of the vertex, rather than for all ancestors along paths to the root vertex; consequently, we observe faster run-times for move-based partitioning algorithms such as simulated annealing (SA), allowing call graphs with thousands of vertices to be processed in less than a second, and 2) we devise a new cost function for SA that allows frequent discovery of better partitioning solutions by searching spaces overlooked by traditional SA cost functions. We present experimental results on a very large design space, where several thousand configurations are explored in minutes as compared to several hours or days using a traditional SA formulation. Furthermore, our approach is frequently able to locate better design points with over 10 % improvement in application execution time compared to the solutions generated by a Kernighan-Lin partitioning algorithm starting with an all-SW partitioning.
    Hardware/Software Codesign and System Synthesis, 2004. CODES + ISSS 2004. International Conference on; 10/2004