Hasan Ersan YAĞCIUnder-Sampling Methods for Imbalanced Data (ClusterCentroids, RandomUnderSampler, NearMiss)The imbalance of data is a big problem for classification tasks. In python, there is a library to allow to use of many algorithms to handle…6 min read·Jul 15, 2021----
Hasan Ersan YAĞCIYellowbrick; Machine Learning VisualizationVisualization is essential to make our analysis or modeling process understandable. We need visualization to see results or workflow…6 min read·Apr 19, 2021----
Hasan Ersan YAĞCIFeature Selection with BorutaPy, RFE and Univariate Feature SelectionFeature selection is one of the important part of the machine learning pipeline. We may have to struggle with a lot of features or useless…7 min read·Apr 4, 2021----
Hasan Ersan YAĞCIRandom Resampling Methods for Imbalanced Data with ImblearnWe want the data we prepared or analyzed for the model to be perfect. However, data may have missing values, outliers, complex data types…6 min read·Mar 25, 2021--5--5
Hasan Ersan YAĞCIFeature Scaling with Scikit-Learn for Data ScienceIn the data science process, we need to do some preprocessing before machine learning algorithms. These can be some basic data analysis…6 min read·Feb 22, 2021----
Hasan Ersan YAĞCIModel Deployment with Streamlit on AWS EC2We have already touched on the importance of model deployment and sharing this model with others. We need to share our model with…9 min read·Feb 12, 2021--1--1
Hasan Ersan YAĞCIIntroduction to Streamlit for Machine Learning Web AppStreamlit is an open-source Python library that makes it easy to build beautiful custom web-apps for machine learning and data science.6 min read·Feb 5, 2021--1--1
Hasan Ersan YAĞCIEncoding with Pandas get_dummiesWe previously covered the issue of encoding and its importance. In short, machine learning models are mathematical models that use…6 min read·Jan 24, 2021--1--1
Hasan Ersan YAĞCILabel Encoding vs One Hot EncodingWe need numerical data in data science techniques such as machine learning and deep learning models. We start our analysis with…5 min read·Jan 18, 2021--1--1
Hasan Ersan YAĞCIDetecting and Handling Outliers with PandasData analysis is a long process. There are some steps to do this. First of all, we need to recognize the data. We have to know every…6 min read·Jan 15, 2021--9--9