Data Analysis
1. Crawler preparation 1.1. Python basic syntax First of all, when you read this article, we assume that you know the basic syntax of Python and are familiar with how to install Python third-party libraries. On this basis, we can continue to look down. 1.2. The overall idea of crawling a web pUTF-8...
Posted by Techissue2008 on Sun, 16 May 2021 03:20:19 +0930
1 topic understanding Title address: https://tianchi.aliyun.com/competition/entrance/231702/introduction?spm=5176.12281973.1005.6.55971f54Zaicht The results of questionnaire survey with open data are used in the competition. Multiple groups of variables are selected, including individual variaUTF-8...
Posted by Jona on Wed, 19 May 2021 04:54:04 +0930
The main difference between this article and the web is that data analysis is done separately (using BI) (using MYSQL) (using Python) (using Excel). 1. BI (Visual Display (PC and Mobile) The following four diagrams are pieced together, and the data can be linked (filter criteria, different compUTF-8...
Posted by dml on Thu, 20 May 2021 05:26:00 +0930
Public number: Euler's cabinAuthor: PeterEdit: Peter Hello, my name is Peter~ Today we will continue to bring Pandas'technique of sampling patterns. The methods highlighted in this paper are: Expression numberquery,evelfilterwhere,mask Extended reading For a pandas article, read: 1,DataFrame DaUTF-8...
Posted by kyleldi on Wed, 26 May 2021 02:35:32 +0930
#Import third party Library import numpy as np import pandas as pd import matplotlib.pyplot as plt import datetime import warnings import re from datetime import datetime from pyecharts.charts import Bar,Line,Map,Page,Pie from pyecharts import options as opts from pyecharts.globals import SymboUTF-8...
Posted by nariman on Wed, 26 May 2021 07:29:05 +0930
1. Getting Web Pages 1. Open a fund web page First we open Tiantian Fund Network, then we open a fund page at random.For example, 161725 China Merchants Certificate Liquor Index. Links: http://fund.eastmoney.com/161725.html. We found that the web address of Tiantian Fund consists of http://funUTF-8...
Posted by Ravenous on Sat, 29 May 2021 08:24:04 +0930
Content introduction This paper introduces several simple methods of user simulation login in daily crawler work. Log in directly using known cookie s principle In short, the cookie is stored in the client that initiated the request, and the server uses the cookie to distinguish different clieUTF-8...
Posted by oeb on Sun, 30 May 2021 04:09:07 +0930
catalogue 7, Using correlation coefficient to judge the correlation of data 1. Using correlation coefficient to judge the correlation of data 2. Find the correlation between a single variable and other variables 8, Variance analysis was used to compare the differences of data 1. Variance analysUTF-8...
Posted by deed02392 on Wed, 02 Jun 2021 04:45:28 +0930
news_analysis.ipynb import pandas as pd import jieba import numpy #pip install jieba Data source: http://www.sogou.com/labs/resource/ca.php df_news = pd.read_table('./data/val.txt',names=['category','theme','URL','content'], encoding='utf-8')#Name by field df_news = df_news.dropna()#Remove missUTF-8...
Posted by jfgreco915 on Wed, 02 Jun 2021 05:20:03 +0930
Baidu cloud link of the file used in this paper: Link: https://pan.baidu.com/s/15-qbrbtRs4frup24Y1i5og Extraction code: pm2c what is the difference between linear fitting and linear prediction? In my opinion, firstly, the application scenarios of the two are different, secondly, the contents ofUTF-8...
Posted by bryson on Tue, 08 Jun 2021 07:10:01 +0930