Conference PaperPDF Available

Automatic Data Extraction Based on Semiconductor Datasheet for Design Automation of Power Converters

Authors:
Automatic Data Extraction based on
Semiconductor Datasheet for Design Automation
of Power Converters
Fanghao Tian1, Diego Bernal Cobaleda1, Wilmar Martinez1
1 Dept of Electrical Engineering (ESAT), KU Leuven-Energyville, Diepenbeek-Genk, Belgium
E-mail: fanghao.tian@kuleuven.be
Abstract—The application of artificial intelligence (AI) to
design automation (DA) is a novel research area in power
electronics, owing to the complexity of power converter de-
sign, power loss modeling on magnetic and semiconductors,
and the large number of components to choose from. This
paper presents a tool for extracting dynamic and complex
nonlinear characteristics from semiconductor datasheets in
order to improve the power loss estimation model, resulting
in an optimal power converter design. An object recognition
neural network, CenterNet, is trained to extract figures
from datasheets. The dynamic data is then extracted from
figures utilizing Optical Character Recognition (OCR) and
morphological image processing techniques. Finally, the
acquired data is used to enhance the dynamic properties
of power switches and to develop a more accurate power
loss model for use as input to design automation tools.
KeywordsArtificial Intelligence, design automation, im-
age processing, power converters, optimization
I. INTRODUCTION
With the development of decarbonized systems, such
as electric cars, electric aviation, and photo-voltaic sys-
tem, the demand for power electronic converters has in-
creased significantly due to their widespread applications
[1]. Increasingly strict requirements for the design of
power converters in terms of efficiency, power density,
volume, and mass are proposed [2]. However, designing
optimal power converters is difficult due to the abundance
of available power electronics components [3]. Further-
more, as it is impossible to test each design experi-
mentally and choose the one with optimal performance,
preselection of devices and simulations of the possible
designs become necessary. Semiconductors simulations
are frequently based on a simplified model which does
not take into account all of the semiconductors’ dynamic
characteristics. Thus, it is critical to enhance the accuracy
of the semiconductor loss calculation model in order to
perform reliable power converter design optimization.
The study on information extraction has made some
progress as the deep learning technique develops. Clark
and Divvala proposed a method for detecting chunks of
body text and locating the figures or tables by reason-
ing about empty regions [4] [5]. Siegel et al. proposed
FigureSeer, an end-to-end framework for automatically
locating figures in academic papers, particularly those
with subfigures, and classifying them into figures in the
result section and other sections using Convolutional
Neural Networks (CNNs). Then, using Optical Charac-
ter Recognition (OCR) algorithms that recognize text
contained within figures such as titles and legends, all
information contained within the figures is extracted [6].
Another powerful tool named Diag2graph was proposed
in [7]. The same algorithm for figure localization as [6]
was used. Various components such as node, text, and
arrow were detected during the figure content analysis.
Additionally, their relations are analyzed. ChartOCR in
[8] is a universal model that extracts key point infor-
mation and determines the chart type at the same time,
then predicts the data range and extracts the text using
Microsoft OCR. The extraction of data from line charts
was studied in [9], while the segmentation of multiple
curves was explored in color plots.
There are few tools especially for data extraction from
power electronics datasheets. In comparison to standard
academic papers, datasheets have several distinguishing
characteristics. (1) There are intensive figures on some
pages. (2) Some figures are in grayscale while others
are in color. (3) The figures are line charts and usually
have grids. On this basis, a special object recognition
algorithm focusing on complex line charts in datasheets
is developed to extract data with high accuracy. This
paper focuses on creating an AI-based tool to extract
information from figures of semiconductors datasheets to
develop a method to improve semiconductors power loss
calculation modeling for simulation in power converter
design automation.
The paper is organized as follows. In section II, A
CenterNet-based object detection algorithm is utilized to
detect and separate the figures from datasheets. In section
III, the same neural network with differently trained
weight parameters is implemented for key elements de-
tection of the figures. Combined with the results of text
recognition by OCR, data from line charts are extracted.
Then, the power loss model of power switches is enriched
by the obtained dynamic data in section IV. The results
and conclusion are in section V and section VI separately.
II. FIGURE SE PARATION
A. Task Overview
To automate the data extraction process from
datasheets, it is necessary to separate the figures in the
datasheets prior to conducting figure analysis. Image
processing algorithms are used to extract data from each
separate figure. Object detection, text recognition, and
line data extraction are the primary tasks. The flowchart
is as shown in Fig.1.
Fig. 1: Overview of data extraction task.
Detailed procedures for figures separation will be de-
scribed as follows.
B. Dataset Construction
The positions of figures are recognized using a CNN-
based object detection algorithm. Therefore, pages of
transistor datasheets are collected to construct the dataset
for training by manually annotating original datasheet
pages. An example of the annotated page is as shown in
Fig.2, where figures on the datasheet page are annotated
with a figure label.
Fig. 2: Examples of annotated page [10].
Different manufacturers’ datasheets have a variety of
different styles. Notably, some pages contain six figures,
while others may only contain four. Additionally, figures
sometimes occupy only a portion of the page. To increase
the object detection algorithm’s universality, datasheets
from several different manufacturers are gathered. Differ-
ent datasheets from the same manufacturer, on the other
hand, have a similar layout. As a result, CNN training
requires only a small amount of training data.
C. CenterNet
Object detection is a challenging task in computer vi-
sion. Numerous algorithms have been developed, such as
YOLO, RCNN, Faster RCNN and etc [12]. CenterNet is
a novel anchor-free object detection algorithm with good
performance for object detection on multiple scales [11].
To begin, a multiple layer deep CNN network serves as
the backbone for feature extraction from original pictures.
Corner pooling and center pooling are implemented on
top of the backbone to obtain two corner heatmaps and a
center key point heatmap, which detect a pair of corners
and a center, respectively, thus determine the bounding
box for the object. The architecture of CenterNet is shown
in Fig.3.
Fig. 3: Architecture of CenterNet.
The choice of backbone CNN is determined by the
complexity of dataset. Due to the similarity of datasheets
pages and figures, ResNet50 is chosen to be the backbone
CNN [13].
D. Training
Following the preparation of the labeled dataset and
the CenterNet algorithm, the training for 100 epochs
begins. A frozen scheme is used to speed up the training
process. Firstly, a well-trained ResNet50 model by a large
classic dataset is loaded, the parameters of which stay
unchanged for the first 50 training epochs. The training
process is then continued with the ResNet50 parameters
unfrozen. The labeled pages are randomly divided into
training and validation dataset with percentages of 90%
and 10%, respectively.
III. DATA EXTRACTION
Individual figures are separated from datasheets and
analyzed to extract detailed information. Similar to the
figure separation task, object detection is the initial step
in locating and recognizing the critical elements of a
figure, such as the title, legend, coordinate origin corner,
(a) (b) (c)
Fig. 4: Line extraction process (a) original image. (b) binarized image. (c) line skeleton.
label, and values of x-axis and y-axis marks. Then text
recognition is required to identify text and numbers,
which are required to comprehend figure information.
Finally, the line data with correct coordinate values are
extracted.
A. Dataset Construction
To train the CenterNet for line chart recognition,
a dataset must be constructed by manually annotating
figures cut from transistor datasheets. Six key elements
are indicated, which are title, legend, label, value, corner,
and other (if exists). Similarly, in order to increase the
universality of the algorithm, figures in a variety of
styles from a variety of manufacturers are gathered. And
they are randomly divided into training and validation
datasets with percentages of 90% and 10%, respectively.
An example of the annotated figure is as shown in Fig.
5.
Fig. 5: Example of annotated figure [10].
The same CenterNet is trained by the line chart dataset
for 100 epochs.
B. Data Extraction
After recognizing the key elements of figures, data can
be extracted by text recognition and line data processing.
1) Text recognition: Optical character recognition
(OCR) has been developing dramatically recently. Tesser-
act, an open-source OCR engine, is utilized in this paper.
Combined with label information from object detection,
recognized texts are classified as title, label and etc.
2) Line data extraction: Prior to digitizing data in a
line chart, it is necessary to specify the coordinate infor-
mation along with the mark values. To begin, recognized
values are classified into x-axis and y-axis values. Then,
because the majority of datasheet figures contain grids,
the figure is binarized to detect vertical and horizontal
lines that correspond to mark values. In this way, an
accurate coordinate is established.
The following step is to remove the background grid
and extract data from individual lines. Since the lines
are generally thicker than the grid, morphological image
processing techniques are applied to eliminate the grid.
More specially, morphological closing, which is a dilation
operation followed by an erosion, is implemented to
remove the thin black grid on the background. Then the
skeleton of lines is extracted to reduce line width into a
single pixel, making data extraction easier.
Finally, for line charts with multiple lines, an addi-
tional step of data correlation is required. After collecting
all pixel coordinates, they are grouped according to their
x axis values, like [(xi, yi1),(xi, yi2),· · · ,(xi, yin)]. The
next dot can be estimated using the coordinates of the
previous two dots on a line as 1:
ˆyi+1 =yiyi1
xixi1
(xi+1 xi) + yi(1)
Then the dot closest to ˆyi+1 is considered to be the
correlated dot for this line. And to prevent incorrect
correlation or noise interference, the distance between
the estimated position and the closest dot is limited to
a specific range.
The process of converting the original figure to a
binarized figure for the purpose of matching mark values,
then to a line skeleton for the purpose of extracting
line data is illustrated in Fig. 4. The title and labels
are removed during the binarization process to eliminate
background noise.
IV. ANALYTICAL SEMICONDUCTOR POW ER L OS SE S
MOD EL
In the power electronics design process, the selection
of semiconductors components significantly impacts the
performance of power converters. Traditionally, engineers
tend to choose the components based on experiences since
It is impractical to manually inspect thousands of semi-
conductor components, and the market is filled up with
semiconductor switching devices with similar voltage and
current characteristics. As a result, additional time is
spent selecting a device. Additionally, the parameters and
variables of semiconductors are highly relevant. To be
able to determine which of them, or which combination of
them will improve the overall efficiency of the converter,
an accurate and fast model is required. However, dynamic
data from datasheets can improve the analytical power
loss model on field effect transistor (FET) to get a more
precise estimation model. Thus, power converter designs
can be thoroughly evaluated and optimized.
Fig. 6: FET capacitances.
Fig.6 Shows the representation of the intrinsic capac-
itance of a conventional FET. Where; input capacitance
Ciss=Cgd +Cg s, output capacitance Coss =Cds +Cgd, and
reverese transfer capacitance Crss=Cgd .
A. Power Loss model
Power losses on semiconductor switches, generally
consist of conduction losses, gate losses, switching losses
and reverse recovery losses. To estimate them, common
used calculation models are shown in (2)-(5) [15]. Addi-
tionally, when calculating power loss, the parameters in
these formulations are typically fixed to typical values
rather than dynamic data from datasheet plots, which
implies a more accurate behavior of the device. In Fig. 7
two screenshots from a power FET are shown, Fig. 7a
depicts the typical parameters usually taken for the design
while Fig. 7b depicts the behaviour of the dynamic
parameters depending on the operation point of the FET.
Pc=RDS(on)) IDRM S
2(2)
Pg=CissV2
GS fs(3)
Psw =VDS ID
2(tr+tf)fs(4)
Prr =CossV2
DS fs(5)
where trand tfare turning on and turning off time, which
can be calculated by formulations in [16].
(a)
(b)
Fig. 7: Parameters for power loss calculation from
datasheets (a) Typical parameter values (b) Dynamic
values [10].
B. Improved power loss model
As previously stated, dynamic data from datasheets
are extracted and stored, which can be used to augment
the model for calculating power loss. For conduction loss
formulation (2), conventional method is to choose a fixed
value for RDS(on). However, according to datasheets,
conduction resistance is a variable that is dependent
on the junction temperature. As a result, the specific
value of turn on resistance should be determined by the
temperature. A dynamic conduction loss map is shown
as Fig. 8 with junction temperature and drain current
changing between [0 150] and [0 50] separately.
For reverse recovery loss (5), the output capacitance
Coss is a variable depending on drain to source voltage
VDS as is shown in 7b. Instead of regarding Coss as a
constant, an improved calculation method is shown in (6).
The integration can be estimated by data from the line
chart.
P
rr =VDS fsZVDS
0
CossV dV (6)
As for the switching loss (4), similar dynamic data can
be implemented in the process of calculating trand tf.
Fig. 8: Conduction loss vs. RMS current and junction
temperature.
V. RE SU LTS
A total of 200 pictures for the figure separation task
and 300 pictures for the line data extraction task have
been trained and tested. As is shown in Fig.9, the training
loss reduces dramatically and stays at a small value. The
final training losses are 2.59 and 0.87 for both tasks, while
the validation losses are 2.15 and 0.57 separately.
(a)
(b)
Fig. 9: Training and validation losses for (a) figure
separation (b) line data extraction.
With the small training and validation losses, the
trained CenterNet can detect the key elements from a
datasheet page or line chart. Two examples of the results
are illustrated in Fig. 10a. Every figure on this page is
detected and selected by the red bounding boxes. With
the bounding box information, the figures can be cut
into individual figures. Additionally, for the line chart
data extraction task, 10b displays the critical elements
highlighted by bounding boxes and classified according to
the pre-defined classes. After applying the OCR for text
recognition and traditional image processing algorithms
for curve extraction, the dynamic data are extracted from
datasheets successfully.
(a)
(b)
Fig. 10: Object detection results for (a) figure separation
(b) line data extraction [10].
VI. CONCLUSIONS
Power loss simulation is critical for design automation
of power electronics. Semiconductor components like
power FETs have unavoidable power loss, especially at
high frequencies. Thus, an accurate and precise mathe-
matical model of semiconductor power losses can result
in the optimal selection of a semiconductor component
for a particular design requirement. By acquiring dynamic
data from datasheets, it is possible to accurately estimate
the power losses of a specific component under specified
operating conditions.
In this paper, a CenterNet based tool for data extrac-
tion of semiconductors datasheets is proposed. Firstly,
CenterNet was trained for two tasks, which are figure
separation from original datasheets and key elements
recognition for line charts. Secondly, data extraction
from line charts is accomplished using image processing
techniques. Finally, an improved dynamic power loss
model is explained, considering junction temperature for
conduction losses and dynamic intrinsic capacitance for
reverse recovery losses. The same method can be utilized
to enhance the switching loss model.
REFERENCES
[1] W. Martinez, C. Cortes, A. Bilal and J. Kyyra, ”Finite Element
Methods for Multi-objective optimization of a High Step-up Inter-
leaved Boost Converter,” in 2018 International Power Electronics
Conference (IPEC-Niigata 2018 ECCE Asia), pp. 2193-2198,
2018.
[2] W. Martinez, M. Yamamoto, J. Imaoka, F. Velandia and C. A.
Cortes, ”Efficiency optimization of a two-phase interleaved boost
DC-DC converter for Electric Vehicle applications,” in 2016
IEEE 8th International Power Electronics and Motion Control
Conference (IPEMC-ECCE Asia), pp. 2474-2480, 2016.
[3] Mouser Electronics. Accessed on: October 10, 2021. [Online].
Available: https://eu.mouser.com/c/semiconductors/discrete-
semiconductors.
[4] C. Clark and S. Divvala. ”Looking Beyond Text: Extracting
Figures, Tables, and Captions from Computer Science Papers,”
in AAAI Workshops at the Twenty-Ninth AAAI Conference on
Artificial Intelligence, pp. 2-8, 2015.
[5] C. Clark and S. Divvala. ”PDFFigures 2.0: Mining figures from
research papers,” in 2016 IEEE/ACM Joint Conference on Digital
Libraries, pp. 143-152, 2016.
[6] N. Siegel, Z. Horvitz, R. Levin, S. Divvala, and A. Farhadi, ”Fig-
ureseer: Parsing result-figures in research papers,” in European
Conference on Computer Vision, pp. 664–680, 2016.
[7] A. Roy, I. Akrotirianakis, A. Kannan, D. Fradkin, A. Canedo, K.
Koneripalli, and T. Kulahcioglu. ”Diag2graph: Representing Deep
Learning Diagrams in Research Papers as Knowledge Graphs, in
2020 IEEE International Conference on Image Processing (ICIP),
pp. 2581-2585, 2020.
[8] J. Luo, Z. Li, J. Wang and C. -Y. Lin, ”ChartOCR: Data
Extraction from Charts Images via a Deep Hybrid Framework,
in 2021 IEEE Winter Conference on Applications of Computer
Vision (WACV), pp. 1916-1924, 2021.
[9] S. Choudhury, S. Wang, P. Mitra, and C. Giles, ”Automated
data extraction from scholarly line graphs,” in Eleventh IAPR
International Workshop on Graphics Recognition (GREC), 2015.
[10] EPC Products datasheets. Accessed on: Febru-
ary 28, 2022. [Online]. Available: https://epc-
co.com/epc/Products/eGaNFETsandICs.aspx.
[11] K. Duan, S. Bai, L. Xie, H. Qi, Q. Huang and Q. Tian, ”Center-
Net: Keypoint Triplets for Object Detection, in 2019 IEEE/CVF
International Conference on Computer Vision (ICCV), pp. 6568-
6577, 2019.
[12] L. Jiao, F. Zhang, F. Liu, S. Yang, L. Li, Z. Feng, and R. Qu, A
Survey of Deep Learning-Based Object Detection”, IEEE Access,
vol. 7, pp. 128837–128868, 2019.
[13] K. He, X. Zhang, S. Ren and J. Sun, ”Deep Residual Learning for
Image Recognition,” 2016 IEEE Conference on Computer Vision
and Pattern Recognition (CVPR),pp. 770-778, 2016.
[14] R. Smith, ”An Overview of the Tesseract OCR Engine, Ninth
International Conference on Document Analysis and Recognition
(ICDAR 2007), pp. 629-633, 2007.
[15] Z. J. Shen, Y. Xiong, X. Cheng, Y. Fu and P. Kumar, ”Power
MOSFET Switching Loss Analysis: A New Insight,” Conference
Record of the 2006 IEEE Industry Applications Conference
Forty-First IAS Annual Meeting, pp. 1438-1442, 2006.
[16] Vishay Siliconix, Appl. Note AN608A, pp.1-6. Available:
https://www.vishay.com/docs/73217/an608a.pdf.
... The results of CenterNet's training and data extraction are examined in Section IV, followed by a case study of using the collected data for selection under various operating conditions in Section V. Concluding remarks are included in Section VI. The initial result of this work was presented in the 2022 International Power Electronics Conference (IPEC-Himeji 2022-ECCE Asia) [14]. ...
Article
Full-text available
Power electronics design automation, implementing Artificial Intelligence (AI) to optimize the design of power converters, has emerged as a novel research topic given the complexity of power converter design, whose key challenges include power loss modeling across the enormous number of available components. This paper proposes a novel end-to-end AI-based tool for extracting nonlinear dynamic properties from semiconductor datasheets, which can enhance the power loss estimation model and accelerate the optimal design of power converters. Firstly, thousands of images from power transistor datasheets are collected and annotated to construct a training database. Then, CenterNet, a neural network for image object detection, is trained for figure segmentation from datasheets and key element detection from figures. Optical Character Recognition (OCR) and morphological image processing techniques are utilized to extract the specific dynamic data. The results illustrate that the customized tool for power transistor device datasheets in this paper can accurately extract the data, significantly reducing the time consumption for transistor data collection and its characteristic modeling work, promising pathways to streamline and optimize power electronics design. The tool has been published online and is actively being updated and improved via http://www.powerbrain.ai.
... In the past ten years, more and more researches dedicate to establishing a component-database, realizing co-action of different design software, and exploring efficient work flow of converter design. Componentdatabases are released either solely as raw data [12], or as web applications with Graphical User Interface (GUI) such as the MagNet Database from Princeton University [13]. Software co-action has been progressively realized as well. ...
Article
Full-text available
Designing power electronics converters for harsh environments is challenging due to the absence of components' performance under harsh conditions, the frequent transition and data-passing among various software, and the time-consuming and computationally heavy work flow. This paper promotes using design automation to address the aforementioned design challenges. The implementations include public-accessible component databases, automated co-action among circuit simulators and finite element simulations to perform electrical, electromagnetic and thermal co-design, and finally an exclusionbased work flow with hierarchical computation to reduce computational load. The theorized framework is exemplified on designing a real world 175 ◦C 1.5kW Three-level Neutral-point-clamped dc-dc converter. A database containing the high-temperature characteristics of SiC MOSFETs and ferrites is established and shared through a web application with graphical user interface. In 310 min, the program, which includes computationally heavy 3D finite element simulation, delivers design output after evaluating the converter's electrical, electromagnetic and thermal performance under 10 million parameter sets. Finally, a 1.5kW dc-dc converter prototype is built and tested in 175 ◦C ambient temperature to verify the quality of the design output
... This is crucial as power losses are significantly influenced by the dynamic characteristics provided in datasheets. Fortunately, a tool proposed in [14] and presented online as PowerBrain [15] facilitates the aggregation of comprehensive datasheet data, thereby laying the groundwork for both loss estimation and comparative analyses of MOSFET devices from diverse manufacturers. ...
... Therefore, AI significantly enhances power converters, especially magnetic design, offering advantages in automation, accuracy, and efficiency. AI's applications range from automated circuit parameter design [26] and data extraction for design automation [27] to overall improvements in the design, control, and maintenance of power converters [10]. It particularly excels in optimizing power loss and magnetic design in complex systems like hybrid AC-DC microgrids [28], outperforming traditional methods by reducing human dependence and improving operational efficiency in various aspects of power electronics. ...
Article
Full-text available
The paper provides an overview of how Artificial Intelligence (AI) is applied in designing highfrequency magnetic components, primarily high-frequency inductors and transformers, for power electronics systems. Four categories of AI, including expert systems, fuzzy logic, metaheuristic methods, and machine learning techniques, are addressed. Firstly, AI models for estimating losses in high-frequency magnetic components are discussed. Subsequently, AI-based design methods in high-frequency inductors and transformers are observed. Then, AI tools applied to the automatic design of high-frequency magnetic components are introduced and compared. Drawing insights from an analysis of over 200 publications, the paper highlights significant advancements: the development of AI-driven models for precise loss estimation in highfrequency magnetic components, the application of AI in optimizing design configurations for the components, and the automation of design processes. These achievements demonstrate AI’s capability to enhance the efficiency, performance, and innovation in high-frequency magnetic component design, offering a roadmap for future research in power electronics systems.
... Furthermore, the compatibility with Steinmetz's equation is validated, as the iGSE for a sinewave input yields the same result. Note that the core loss results are very sensitive to the Steinmetz parameters, which are extracted using the data extraction tool of [18], [19] and kept constant in all models. ...
Article
Full-text available
The design of electronic circuits is critical for a wide range of applications, from the electri-fication of transportation to the Internet of Things (IoT). It demands substantial resources, is time-intensive, and can be highly intricate. Current design methods often lead to inefficiencies, prolonged design cycles, and susceptibility to human error. Advancements in artificial intelligence (AI) play a crucial role in power electronics design by increasing efficiency, promoting automation, and enhancing sustainability of electrical systems. Research has demonstrated the applications of AI in power electronics to enhance system performance, optimization, and control strategy using machine learning, fuzzy logic, expert systems, and metaheuristic methods. However, a review that includes the recent AI advancements and potential of large language models (LLMs) like generative pre-train transformers (GPT) has not been reported. This paper presents an overview of applications of AI in power electronics (PE) including the potential of LLMs. The influence of LLMs-AI on the design process of PE and future research directions is also highlighted. The development of advanced AI algorithms such as pre-train transformers, real-time implementations, interdisciplinary collaboration, and data-driven approaches are also discussed. The proposed LLMs-AI is used to design parameters of high-frequency wireless power transfer (HFWPT) using MATLAB as a first case study, and high-frequency alternating current (HFAC) inverter using PSIM as a second case study. The proposed LLM-AI driven design is verified based on a similar design reported in the literature and Wilcoxon signed-rank test was conducted to further validate the result. Results show that the LLM-AI driven design based on the OpenAI foundation model has the potential to streamline the design process of power electronics. These findings provide a good reference on the feasibility of LLMs-AI on power electronic design.
Article
Full-text available
Object detection is one of the most important and challenging branches of computer vision, which has been widely applied in people’s life, such as monitoring security, autonomous driving and so on, with the purpose of locating instances of semantic objects of a certain class. With the rapid development of deep learning algorithms for detection tasks, the performance of object detectors has been greatly improved. In order to understand the main development status of object detection pipeline thoroughly and deeply, in this survey, we analyze the methods of existing typical detection models and describe the benchmark datasets at first. Afterwards and primarily, we provide a comprehensive overview of a variety of object detection methods in a systematic manner, covering the one-stage and two-stage detectors. Moreover, we list the traditional and new applications. Some representative branches of object detection are analyzed as well. Finally, we discuss the architecture of exploiting these object detection methods to build an effective and efficient system and point out a set of development trends to better follow the state-of-the-art algorithms and further research.
Conference Paper
Full-text available
Power losses and their consequences in the addition of storage cells have the negative effect of decreasing the power density and the efficiency in Electric Vehicles. For this reason, an efficiency optimization methodology is required to help reduce that problem. Specifically, in the power converters that interface the storage unit with the electric motors and their inverters, an efficiency optimization is essential to reduce the power losses and thereby downsize the cooling components and the storage unit. In this work, the topology under evaluation is the two-phase interleaved boost converter using different magnetic components such as coupled and non-coupled inductors, which are topologies known as effective for high power density applications. This paper presents a methodology that optimizes the efficiency of the chosen topologies through a complete power loss modeling of each component. Next generation components such as Super Junction Mosfets, GaN and SiC diodes and Mosfets are compared to obtain the most efficient and suitable material to be implemented to the topologies, especially to the converter with coupled-inductor. Moreover, a design procedure is proposed to integrate the loss model and the characteristics of the selected components as the base to obtain the objective function, which is later solved using analytical calculations. Finally, the optimization methodology is validated by experimental tests.
Conference Paper
‘Which are the pedestrian detectors that yield a precision above 95 % at 25 % recall?’ Answering such a complex query involves identifying and analyzing the results reported in figures within several research papers. Despite the availability of excellent academic search engines, retrieving such information poses a cumbersome challenge today as these systems have primarily focused on understanding the text content of scholarly documents. In this paper, we introduce FigureSeer, an end-to-end framework for parsing result-figures, that enables powerful search and retrieval of results in research papers. Our proposed approach automatically localizes figures from research papers, classifies them, and analyses the content of the result-figures. The key challenge in analyzing the figure content is the extraction of the plotted data and its association with the legend entries. We address this challenge by formulating a novel graph-based reasoning approach using a CNN-based similarity metric. We present a thorough evaluation on a real-word annotated dataset to demonstrate the efficacy of our approach.
Conference Paper
Figures and tables are key sources of information in many scholarly documents. However, current academic search engines do not make use of figures and tables when semantically parsing documents or presenting document summaries to users. To facilitate these applications we develop an algorithm that extracts figures, tables, and captions from documents called "PDFFigures 2.0." Our proposed approach analyzes the structure of individual pages by detecting captions, graphical elements, and chunks of body text, and then locates figures and tables by reasoning about the empty regions within that text. To evaluate our work, we introduce a new dataset of computer science papers, along with ground truth labels for the locations of the figures, tables, and captions within them. Our algorithm achieves impressive results (94% precision at 90% recall) on this dataset surpassing previous state of the art. Further, we show how our framework was used to extract figures from a corpus of over one million papers, and how the resulting extractions were integrated into the user interface of a smart academic search engine, Semantic Scholar (www.semanticscholar.org). Finally, we present results of exploratory data analysis completed on the extracted figures as well as an extension of our method for the task of section title extraction. We release our dataset and code on our project webpage for enabling future research (http://pdffigures2.allenai.org).
Article
Deeper neural networks are more difficult to train. We present a residual learning framework to ease the training of networks that are substantially deeper than those used previously. We explicitly reformulate the layers as learning residual functions with reference to the layer inputs, instead of learning unreferenced functions. We provide comprehensive empirical evidence showing that these residual networks are easier to optimize, and can gain accuracy from considerably increased depth. On the ImageNet dataset we evaluate residual nets with a depth of up to 152 layers---8x deeper than VGG nets but still having lower complexity. An ensemble of these residual nets achieves 3.57% error on the ImageNet test set. This result won the 1st place on the ILSVRC 2015 classification task. We also present analysis on CIFAR-10 with 100 and 1000 layers. The depth of representations is of central importance for many visual recognition tasks. Solely due to our extremely deep representations, we obtain a 28% relative improvement on the COCO object detection dataset. Deep residual nets are foundations of our submissions to ILSVRC & COCO 2015 competitions, where we also won the 1st places on the tasks of ImageNet detection, ImageNet localization, COCO detection, and COCO segmentation.