Figure 7 - available via license: Creative Commons Attribution-NonCommercial 4.0 International
Content may be subject to copyright.
Source publication
House prices have significant impact on people’s daily life, and it is essential for people to have fixed abode, to live, work and social prosperity and stability. Hence predicting House price is a meaningful and big challenge. To achieve this goal, we use California Census dataset in this project to how distinctive features (attributes) can make t...
Context in source publication
Context 1
... there are some ways to transfer the non-normal variable into normal ones such as logarithm transformation. Applying the logarithm on the house value, we then plot the histogram of the house price in Figure 7. We can see that the distribution is more like a normal distribution with 0 skewness. ...
Similar publications
Mismanagement of fresh water is a primary concern that negatively impacts agricultural productivity. Judicious use of water in agriculture is possible by estimating the optimal requirement. The present practice of estimating crop water requirements is using reference evapotranspiration (ET0) values, which is considered a standard method. Hence, pre...
Citations
... In Equation 6, is the actual value of the target variable, is the predicted value, and the ( ) is the mean of the actual values. ...
In this research, we introduce two new machine learning regression methods: the Ensemble Average and the Pipelined Model. These methods aim to enhance traditional regression analysis for predictive tasks and have undergone thorough evaluation across three datasets: Kaggle House Price, Boston House Price, and California Housing, using various performance metrics. The results consistently show that our models outperform existing methods in terms of accuracy and reliability across all three datasets. The Pipelined Model, in particular, is notable for its ability to combine predictions from multiple models, leading to higher accuracy and impressive scalability. This scalability allows for their application in diverse fields like technology, finance, and healthcare. Furthermore, these models can be adapted for real-time and streaming data analysis, making them valuable for applications such as fraud detection, stock market prediction, and IoT sensor data analysis. Enhancements to the models also make them suitable for big data applications, ensuring their relevance for large datasets and distributed computing environments. It’s important to acknowledge some limitations of our models, including potential data biases, specific assumptions, increased complexity, and challenges related to interpretability when using them in practical scenarios. Nevertheless, these innovations advance predictive modeling, and our comprehensive evaluation underscores their potential to provide increased accuracy and reliability across a wide range of applications. The results indicate that the proposed models outperform existing models in terms of accuracy and robustness for all three datasets. The source code can be found at https://huggingface.co/DebajyotyBanik/Ensemble-Pipelined-Regression/tree/main.