DataWhale Integrated Learning Task13--Stacking Integrated Learning Algorithm

Based on the previous discussion of Blending's integrated learning algorithm, we know that Blending only uses data from the validation set in the integration process, which is actually a huge waste of data.To solve this problem, we analyze in detail where Blending is experiencing problems and hUTF-8...

Posted by curtm on Fri, 14 May 2021 02:49:37 +0930

python saves content to files (text, json, csv)

python saves content to files (text, json, csv) In the daily life of developers, saving data to a file is one of the most common programming tasks. Usually, a program needs some input and produces some output. In many cases, we want to persist these results. We may find ourselves saving data toUTF-8...

Posted by frans-jan on Fri, 14 May 2021 06:19:35 +0930

DataWhale integrated learning task 14: the prediction of happiness

Integrated learning case 1 (happiness prediction) Background Happiness is an old and profound topic, which is the direction of human pursuit for generations. There are thousands of factors related to well-being, which vary from person to person, such as the national economy and people's livelihUTF-8...

Posted by gbrown on Tue, 18 May 2021 04:10:22 +0930

Integrated learning case: happiness prediction

Datawhale open source project: machine learning, integrated learning and model fusion (based on python) [link] The content of this study is more complex, mainly to be familiar with the whole process and understand the meaning of each code sentence. Data information Based on 139 dimensional charUTF-8...

Posted by brianb on Wed, 19 May 2021 04:50:16 +0930

Case analysis of DataWhale integrated learning notes and happiness mining

1 topic understanding Title address: https://tianchi.aliyun.com/competition/entrance/231702/introduction?spm=5176.12281973.1005.6.55971f54Zaicht The results of questionnaire survey with open data are used in the competition. Multiple groups of variables are selected, including individual variaUTF-8...

Posted by Jona on Wed, 19 May 2021 04:54:04 +0930

Decision Tree Code - python Source Code, you can read it completely

decision tree Decision Tree is described in detail in Watermelon Book of Zhou Zhihua (Watermelon Book P73-P79), which is also the clearest explanation of decision tree that I have ever talked about. I will not make a fool of it here. This article is mainly about sharing the code of decision treUTF-8...

Posted by elmar on Fri, 21 May 2021 01:55:18 +0930

Today, on May 20, I forgot all my girlfriends and spent several hours sorting out the super detailed Java tail recursion.

1, Preface Tail call Wikipedia In computer science, tail call refers to the situation that the last action in a function returns the call result of a function, that is, the return value of the last new call is directly returned by the return result of the current function. In this case, the taiUTF-8...

Posted by Blulagoon on Fri, 21 May 2021 04:49:27 +0930

Using sklearn to realize SVM and data preprocessing

1. SVM algorithm Support vector machine (SVM) is a kind of generalized linear classifier which classifies data in the way of supervised learning. Its decision boundary is the maximum margin hyperplane of learning samples. SVM uses hinge loss function to calculate empirical risk, and adds regulaUTF-8...

Posted by Amitk on Sun, 23 May 2021 03:19:16 +0930

Some trick s of data feature Engineering

Data analysis (the following are all cases of overdue forecast of risk control) Link: https://tianchi.aliyun.com/notebook-ai/detail?spm=5176.12586969.1002.6.3b3022faB2aZ2J&postId=129320 1. Import the database needed for data analysis and visualization Here are some inline code slices. import pUTF-8...

Posted by richierich on Sun, 23 May 2021 04:05:15 +0930

Introduction to anomaly detection series (5. Anomaly detection of high dimensional data)

Introduction to anomaly detection series (5. Anomaly detection of high dimensional data) preface in real scenarios, many data sets are multi-dimensional. With the increase of dimensions, the size (volume) of the data space will grow exponentially, making the data become sparse, which is the proUTF-8...

Posted by anto on Mon, 24 May 2021 04:14:26 +0930