Dataset info
Number of variables | 121 |
---|---|
Number of observations | 10000 |
Missing cells | 288213 (23.8%) |
Duplicate rows | 0 (0.0%) |
Total size in memory | 9.2 MiB |
Average record size in memory | 968.0 B |
Variables types
Numeric | 33 |
---|---|
Categorical | 17 |
Boolean | 23 |
Date | 0 |
URL | 0 |
Text (Unique) | 0 |
Rejected | 48 |
Unsupported | 0 |
Warnings
AMT_GOODS_PRICE is highly correlated with AMT_CREDIT (ρ = 0.9880646871) | Rejected |
AMT_REQ_CREDIT_BUREAU_DAY has 1246 (12.5%) missing values | Missing |
AMT_REQ_CREDIT_BUREAU_HOUR has 1246 (12.5%) missing values | Missing |
AMT_REQ_CREDIT_BUREAU_MON has 1246 (12.5%) missing values | Missing |
AMT_REQ_CREDIT_BUREAU_QRT has 4864 (48.6%) zeros | Zeros |
AMT_REQ_CREDIT_BUREAU_QRT has 1246 (12.5%) missing values | Missing |
AMT_REQ_CREDIT_BUREAU_WEEK has 1246 (12.5%) missing values | Missing |
AMT_REQ_CREDIT_BUREAU_YEAR has 2267 (22.7%) zeros | Zeros |
AMT_REQ_CREDIT_BUREAU_YEAR has 1246 (12.5%) missing values | Missing |
APARTMENTS_AVG has 4874 (48.7%) missing values | Missing |
APARTMENTS_MEDI is highly correlated with APARTMENTS_AVG (ρ = 0.9972131584) | Rejected |
APARTMENTS_MODE is highly correlated with APARTMENTS_MEDI (ρ = 0.9817942349) | Rejected |
BASEMENTAREA_AVG has 444 (4.4%) zeros | Zeros |
BASEMENTAREA_AVG has 5686 (56.9%) missing values | Missing |
BASEMENTAREA_MEDI is highly correlated with BASEMENTAREA_AVG (ρ = 0.985925455) | Rejected |
BASEMENTAREA_MODE is highly correlated with BASEMENTAREA_MEDI (ρ = 0.9731043737) | Rejected |
CNT_CHILDREN has 7167 (71.7%) zeros | Zeros |
COMMONAREA_AVG has 274 (2.7%) zeros | Zeros |
COMMONAREA_AVG has 6893 (68.9%) missing values | Missing |
COMMONAREA_MEDI is highly correlated with COMMONAREA_AVG (ρ = 0.9956920971) | Rejected |
COMMONAREA_MODE is highly correlated with COMMONAREA_MEDI (ρ = 0.985031091) | Rejected |
DAYS_LAST_PHONE_CHANGE has 1170 (11.7%) zeros | Zeros |
DEF_30_CNT_SOCIAL_CIRCLE has 8891 (88.9%) zeros | Zeros |
DEF_60_CNT_SOCIAL_CIRCLE has 9185 (91.8%) zeros | Zeros |
ELEVATORS_AVG has 2806 (28.1%) zeros | Zeros |
ELEVATORS_AVG has 5172 (51.7%) missing values | Missing |
ELEVATORS_MEDI is highly correlated with ELEVATORS_AVG (ρ = 0.9956755484) | Rejected |
ELEVATORS_MODE is highly correlated with ELEVATORS_MEDI (ρ = 0.9866157093) | Rejected |
EMERGENCYSTATE_MODE has 4527 (45.3%) missing values | Missing |
ENTRANCES_AVG has 4830 (48.3%) missing values | Missing |
ENTRANCES_MEDI is highly correlated with ENTRANCES_AVG (ρ = 0.9968285611) | Rejected |
ENTRANCES_MODE is highly correlated with ENTRANCES_MEDI (ρ = 0.9800705269) | Rejected |
EXT_SOURCE_1 has 4171 (41.7%) missing values | Missing |
EXT_SOURCE_3 has 1813 (18.1%) missing values | Missing |
FLAG_DOCUMENT_10 has constant value "0" | Rejected |
FLAG_DOCUMENT_12 has constant value "0" | Rejected |
FLAG_DOCUMENT_13 has constant value "0" | Rejected |
FLAG_DOCUMENT_14 has constant value "0" | Rejected |
FLAG_DOCUMENT_15 has constant value "0" | Rejected |
FLAG_DOCUMENT_16 has constant value "0" | Rejected |
FLAG_DOCUMENT_17 has constant value "0" | Rejected |
FLAG_DOCUMENT_19 has constant value "0" | Rejected |
FLAG_DOCUMENT_2 has constant value "0" | Rejected |
FLAG_DOCUMENT_20 has constant value "0" | Rejected |
FLAG_DOCUMENT_21 has constant value "0" | Rejected |
FLAG_DOCUMENT_7 has constant value "0" | Rejected |
FLAG_MOBIL has constant value "1" | Rejected |
FLOORSMAX_AVG has 4770 (47.7%) missing values | Missing |
FLOORSMAX_MEDI is highly correlated with FLOORSMAX_AVG (ρ = 0.9954940358) | Rejected |
FLOORSMAX_MODE is highly correlated with FLOORSMAX_MEDI (ρ = 0.9904378273) | Rejected |
FLOORSMIN_AVG has 6702 (67.0%) missing values | Missing |
FLOORSMIN_MEDI is highly correlated with FLOORSMIN_AVG (ρ = 0.9971154706) | Rejected |
FLOORSMIN_MODE is highly correlated with FLOORSMIN_MEDI (ρ = 0.988324407) | Rejected |
FONDKAPREMONT_MODE has 6732 (67.3%) missing values | Missing |
HOUSETYPE_MODE has 4826 (48.3%) missing values | Missing |
LANDAREA_AVG has 531 (5.3%) zeros | Zeros |
LANDAREA_AVG has 5792 (57.9%) missing values | Missing |
LANDAREA_MEDI is highly correlated with LANDAREA_AVG (ρ = 0.9803449477) | Rejected |
LANDAREA_MODE is highly correlated with LANDAREA_MEDI (ρ = 0.9902680947) | Rejected |
LIVINGAPARTMENTS_AVG is highly correlated with APARTMENTS_MODE (ρ = 0.941832096) | Rejected |
LIVINGAPARTMENTS_MEDI is highly correlated with LIVINGAPARTMENTS_AVG (ρ = 0.9973914912) | Rejected |
LIVINGAPARTMENTS_MODE is highly correlated with LIVINGAPARTMENTS_MEDI (ρ = 0.9700264425) | Rejected |
LIVINGAREA_AVG is highly correlated with LIVINGAPARTMENTS_MEDI (ρ = 0.9004271393) | Rejected |
LIVINGAREA_MEDI is highly correlated with LIVINGAREA_AVG (ρ = 0.9966418413) | Rejected |
LIVINGAREA_MODE is highly correlated with LIVINGAREA_MEDI (ρ = 0.9719932992) | Rejected |
NAME_TYPE_SUITE has 180 (1.8%) missing values | Missing |
NONLIVINGAPARTMENTS_AVG has 1804 (18.0%) zeros | Zeros |
NONLIVINGAPARTMENTS_AVG has 6859 (68.6%) missing values | Missing |
NONLIVINGAPARTMENTS_MEDI is highly correlated with NONLIVINGAPARTMENTS_AVG (ρ = 0.9355157411) | Rejected |
NONLIVINGAPARTMENTS_MODE is highly correlated with NONLIVINGAPARTMENTS_MEDI (ρ = 0.9507902268) | Rejected |
NONLIVINGAREA_AVG has 1936 (19.4%) zeros | Zeros |
NONLIVINGAREA_AVG has 5354 (53.5%) missing values | Missing |
NONLIVINGAREA_MEDI is highly correlated with NONLIVINGAREA_AVG (ρ = 0.994807585) | Rejected |
NONLIVINGAREA_MODE is highly correlated with NONLIVINGAREA_MEDI (ρ = 0.988191842) | Rejected |
OBS_30_CNT_SOCIAL_CIRCLE has 5371 (53.7%) zeros | Zeros |
OBS_60_CNT_SOCIAL_CIRCLE is highly correlated with OBS_30_CNT_SOCIAL_CIRCLE (ρ = 0.9990456696) | Rejected |
OCCUPATION_TYPE has 3162 (31.6%) missing values | Missing |
ORGANIZATION_TYPE has a high cardinality: 57 distinct values | Warning |
OWN_CAR_AGE has 6702 (67.0%) missing values | Missing |
REGION_RATING_CLIENT_W_CITY is highly correlated with REGION_RATING_CLIENT (ρ = 0.9323217636) | Rejected |
TOTALAREA_MODE is highly correlated with LIVINGAREA_MODE (ρ = 0.9013424924) | Rejected |
WALLSMATERIAL_MODE has 4882 (48.8%) missing values | Missing |
YEARS_BEGINEXPLUATATION_AVG has 4656 (46.6%) missing values | Missing |
YEARS_BEGINEXPLUATATION_MEDI is highly correlated with YEARS_BEGINEXPLUATATION_AVG (ρ = 0.9999688217) | Rejected |
YEARS_BEGINEXPLUATATION_MODE is highly correlated with YEARS_BEGINEXPLUATATION_MEDI (ρ = 0.999758927) | Rejected |
YEARS_BUILD_AVG is highly correlated with YEARS_BEGINEXPLUATATION_MODE (ρ = 0.9612937056) | Rejected |
YEARS_BUILD_MEDI is highly correlated with YEARS_BUILD_AVG (ρ = 0.9989002843) | Rejected |
YEARS_BUILD_MODE is highly correlated with YEARS_BUILD_MEDI (ρ = 0.992116652) | Rejected |
AMT_ANNUITY
Numeric
Distinct count | 3658 |
---|---|
Unique (%) | 36.6% |
Missing (%) | < 0.1% |
Missing (n) | 5 |
Infinite (%) | 0.0% |
Infinite (n) | 0 |
Mean | 29415.78504 |
---|---|
Minimum | 2295 |
Maximum | 177826.5 |
Zeros (%) | 0.0% |
Quantile statistics
Minimum | 2295 |
---|---|
5-th percentile | 9432 |
Q1 | 18130.5 |
Median | 26230.5 |
Q3 | 37388.25 |
95-th percentile | 57676.5 |
Maximum | 177826.5 |
Range | 175531.5 |
Interquartile range | 19257.75 |
Descriptive statistics
Standard deviation | 16023.69177 |
---|---|
Coef of variation | 0.5447310601 |
Kurtosis | 6.734197292 |
Mean | 29415.78504 |
MAD | 12064.57121 |
Skewness | 1.618184647 |
Sum | 294010771.5 |
Variance | 256758698 |
Memory size | 78.2 KiB |
Value | Count | Frequency (%) | |
30838.5 | 53 | 0.5% | |
27652.5 | 52 | 0.5% | |
22977 | 52 | 0.5% | |
43659 | 48 | 0.5% | |
52452 | 43 | 0.4% | |
23107.5 | 41 | 0.4% | |
30951 | 41 | 0.4% | |
24696 | 39 | 0.4% | |
26838 | 37 | 0.4% | |
23539.5 | 36 | 0.4% | |
Other values (3647) | 9553 | 95.5% |
Minimum 5 values
Value | Count | Frequency (%) | |
2295 | 1 | < 0.1% | |
3478.5 | 1 | < 0.1% | |
3685.5 | 2 | < 0.1% | |
3730.5 | 1 | < 0.1% | |
4113 | 1 | < 0.1% |
Maximum 5 values
Value | Count | Frequency (%) | |
177826.5 | 2 | < 0.1% | |
177696 | 1 | < 0.1% | |
176062.5 | 1 | < 0.1% | |
173704.5 | 2 | < 0.1% | |
173574 | 1 | < 0.1% |
AMT_CREDIT
Numeric
Distinct count | 1564 |
---|---|
Unique (%) | 15.6% |
Missing (%) | 0.0% |
Missing (n) | 0 |
Infinite (%) | 0.0% |
Infinite (n) | 0 |
Mean | 519256.2064 |
---|---|
Minimum | 45000 |
Maximum | 2156400 |
Zeros (%) | 0.0% |
Quantile statistics
Minimum | 45000 |
---|---|
5-th percentile | 118512 |
Q1 | 260640 |
Median | 450000 |
Q3 | 675000 |
95-th percentile | 1272419.325 |
Maximum | 2156400 |
Range | 2111400 |
Interquartile range | 414360 |
Descriptive statistics
Standard deviation | 368122.9614 |
---|---|
Coef of variation | 0.708942824 |
Kurtosis | 3.181682535 |
Mean | 519256.2064 |
MAD | 271531.341 |
Skewness | 1.624651469 |
Sum | 5192562064 |
Variance | 1.355145147e+11 |
Memory size | 78.2 KiB |
Value | Count | Frequency (%) | |
450000 | 434 | 4.3% | |
225000 | 384 | 3.8% | |
675000 | 299 | 3.0% | |
360000 | 175 | 1.8% | |
900000 | 168 | 1.7% | |
260640 | 137 | 1.4% | |
135000 | 132 | 1.3% | |
296280 | 124 | 1.2% | |
270000 | 112 | 1.1% | |
539100 | 111 | 1.1% | |
Other values (1554) | 7924 | 79.2% |
Minimum 5 values
Value | Count | Frequency (%) | |
45000 | 31 | 0.3% | |
49500 | 1 | < 0.1% | |
49752 | 21 | 0.2% | |
52128 | 24 | 0.2% | |
54000 | 7 | 0.1% |
Maximum 5 values
Value | Count | Frequency (%) | |
2156400 | 12 | 0.1% | |
2140227 | 1 | < 0.1% | |
2085120 | 1 | < 0.1% | |
2013840 | 44 | 0.4% | |
2000061 | 1 | < 0.1% |
AMT_GOODS_PRICE
Highly correlated
This variable is highly correlated with AMT_CREDIT
and should be ignored for analysis
Correlation | 0.9880646871 |
---|
AMT_INCOME_TOTAL
Numeric
Distinct count | 253 |
---|---|
Unique (%) | 2.5% |
Missing (%) | 0.0% |
Missing (n) | 0 |
Infinite (%) | 0.0% |
Infinite (n) | 0 |
Mean | 178153.1145 |
---|---|
Minimum | 26941.5 |
Maximum | 3150000 |
Zeros (%) | 0.0% |
Quantile statistics
Minimum | 26941.5 |
---|---|
5-th percentile | 69727.5 |
Q1 | 112500 |
Median | 157500 |
Q3 | 225000 |
95-th percentile | 360000 |
Maximum | 3150000 |
Range | 3123058.5 |
Interquartile range | 112500 |
Descriptive statistics
Standard deviation | 100945.4722 |
---|---|
Coef of variation | 0.5666219896 |
Kurtosis | 93.29420592 |
Mean | 178153.1145 |
MAD | 67578.64978 |
Skewness | 5.091543665 |
Sum | 1781531145 |
Variance | 1.018998836e+10 |
Memory size | 78.2 KiB |
Value | Count | Frequency (%) | |
135000 | 1168 | 11.7% | |
112500 | 1026 | 10.3% | |
157500 | 860 | 8.6% | |
180000 | 847 | 8.5% | |
225000 | 741 | 7.4% | |
202500 | 636 | 6.4% | |
90000 | 583 | 5.8% | |
270000 | 388 | 3.9% | |
67500 | 272 | 2.7% | |
315000 | 241 | 2.4% | |
Other values (243) | 3238 | 32.4% |
Minimum 5 values
Value | Count | Frequency (%) | |
26941.5 | 1 | < 0.1% | |
27000 | 1 | < 0.1% | |
28800 | 1 | < 0.1% | |
29250 | 1 | < 0.1% | |
30150 | 1 | < 0.1% |
Maximum 5 values
Value | Count | Frequency (%) | |
3150000 | 1 | < 0.1% | |
1800000 | 1 | < 0.1% | |
1575000 | 1 | < 0.1% | |
1125000 | 5 | 0.1% | |
945000 | 1 | < 0.1% |
AMT_REQ_CREDIT_BUREAU_DAY
Categorical
Distinct count | 4 |
---|---|
Unique (%) | < 0.1% |
Missing (%) | 12.5% |
Missing (n) | 1246 |
0 | |
---|---|
1 | 19 |
2 | 1 |
(Missing) | 1246 |
Value | Count | Frequency (%) | |
0 | 8734 | 87.3% | |
1 | 19 | 0.2% | |
2 | 1 | < 0.1% | |
(Missing) | 1246 | 12.5% |
Max length | 3 |
---|---|
Mean length | 3 |
Min length | 3 |
Contains chars | True |
Contains digits | True |
Contains spaces | False |
Contains non-words | True |
AMT_REQ_CREDIT_BUREAU_HOUR
Boolean
Distinct count | 3 |
---|---|
Unique (%) | < 0.1% |
Missing (%) | 12.5% |
Missing (n) | 1246 |
0 | |
---|---|
1 | 16 |
(Missing) | 1246 |
Value | Count | Frequency (%) | |
0 | 8738 | 87.4% | |
1 | 16 | 0.2% | |
(Missing) | 1246 | 12.5% |
AMT_REQ_CREDIT_BUREAU_MON
Categorical
Distinct count | 4 |
---|---|
Unique (%) | < 0.1% |
Missing (%) | 12.5% |
Missing (n) | 1246 |
0 | |
---|---|
1 | 60 |
2 | 4 |
(Missing) | 1246 |
Value | Count | Frequency (%) | |
0 | 8690 | 86.9% | |
1 | 60 | 0.6% | |
2 | 4 | < 0.1% | |
(Missing) | 1246 | 12.5% |
Max length | 3 |
---|---|
Mean length | 3 |
Min length | 3 |
Contains chars | True |
Contains digits | True |
Contains spaces | False |
Contains non-words | True |
AMT_REQ_CREDIT_BUREAU_QRT
Numeric
Distinct count | 7 |
---|---|
Unique (%) | 0.1% |
Missing (%) | 12.5% |
Missing (n) | 1246 |
Infinite (%) | 0.0% |
Infinite (n) | 0 |
Mean | 0.5435229609 |
---|---|
Minimum | 0 |
Maximum | 5 |
Zeros (%) | 48.6% |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
Median | 0 |
Q3 | 1 |
95-th percentile | 2 |
Maximum | 5 |
Range | 5 |
Interquartile range | 1 |
Descriptive statistics
Standard deviation | 0.6956572173 |
---|---|
Coef of variation | 1.27990401 |
Kurtosis | 1.902756075 |
Mean | 0.5435229609 |
MAD | 0.6039971857 |
Skewness | 1.27405145 |
Sum | 4758 |
Variance | 0.483938964 |
Memory size | 78.2 KiB |
Value | Count | Frequency (%) | |
0 | 4864 | 48.6% | |
1 | 3166 | 31.7% | |
2 | 598 | 6.0% | |
3 | 110 | 1.1% | |
4 | 14 | 0.1% | |
5 | 2 | < 0.1% | |
(Missing) | 1246 | 12.5% |
Minimum 5 values
Value | Count | Frequency (%) | |
0 | 4864 | 48.6% | |
1 | 3166 | 31.7% | |
2 | 598 | 6.0% | |
3 | 110 | 1.1% | |
4 | 14 | 0.1% |
Maximum 5 values
Value | Count | Frequency (%) | |
5 | 2 | < 0.1% | |
4 | 14 | 0.1% | |
3 | 110 | 1.1% | |
2 | 598 | 6.0% | |
1 | 3166 | 31.7% |
AMT_REQ_CREDIT_BUREAU_WEEK
Categorical
Distinct count | 4 |
---|---|
Unique (%) | < 0.1% |
Missing (%) | 12.5% |
Missing (n) | 1246 |
0 | |
---|---|
1 | 23 |
2 | 1 |
(Missing) | 1246 |
Value | Count | Frequency (%) | |
0 | 8730 | 87.3% | |
1 | 23 | 0.2% | |
2 | 1 | < 0.1% | |
(Missing) | 1246 | 12.5% |
Max length | 3 |
---|---|
Mean length | 3 |
Min length | 3 |
Contains chars | True |
Contains digits | True |
Contains spaces | False |
Contains non-words | True |
AMT_REQ_CREDIT_BUREAU_YEAR
Numeric
Distinct count | 14 |
---|---|
Unique (%) | 0.1% |
Missing (%) | 12.5% |
Missing (n) | 1246 |
Infinite (%) | 0.0% |
Infinite (n) | 0 |
Mean | 1.969499657 |
---|---|
Minimum | 0 |
Maximum | 12 |
Zeros (%) | 22.7% |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
Median | 2 |
Q3 | 3 |
95-th percentile | 6 |
Maximum | 12 |
Range | 12 |
Interquartile range | 3 |
Descriptive statistics
Standard deviation | 1.840890771 |
---|---|
Coef of variation | 0.9346997164 |
Kurtosis | 1.109571591 |
Mean | 1.969499657 |
MAD | 1.429623164 |
Skewness | 1.072093196 |
Sum | 17241 |
Variance | 3.388878831 |
Memory size | 78.2 KiB |
Value | Count | Frequency (%) | |
0 | 2267 | 22.7% | |
1 | 1849 | 18.5% | |
2 | 1775 | 17.8% | |
3 | 1290 | 12.9% | |
4 | 709 | 7.1% | |
5 | 419 | 4.2% | |
6 | 217 | 2.2% | |
7 | 128 | 1.3% | |
8 | 66 | 0.7% | |
9 | 28 | 0.3% | |
Other values (3) | 6 | 0.1% | |
(Missing) | 1246 | 12.5% |
Minimum 5 values
Value | Count | Frequency (%) | |
0 | 2267 | 22.7% | |
1 | 1849 | 18.5% | |
2 | 1775 | 17.8% | |
3 | 1290 | 12.9% | |
4 | 709 | 7.1% |
Maximum 5 values
Value | Count | Frequency (%) | |
12 | 1 | < 0.1% | |
11 | 1 | < 0.1% | |
10 | 4 | < 0.1% | |
9 | 28 | 0.3% | |
8 | 66 | 0.7% |
APARTMENTS_AVG
Numeric
Distinct count | 796 |
---|---|
Unique (%) | 8.0% |
Missing (%) | 48.7% |
Missing (n) | 4874 |
Infinite (%) | 0.0% |
Infinite (n) | 0 |
Mean | 0.122804604 |
---|---|
Minimum | 0 |
Maximum | 1 |
Zeros (%) | 0.3% |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0.0124 |
Q1 | 0.0619 |
Median | 0.0928 |
Q3 | 0.1485 |
95-th percentile | 0.345125 |
Maximum | 1 |
Range | 1 |
Interquartile range | 0.0866 |
Descriptive statistics
Standard deviation | 0.1116105768 |
---|---|
Coef of variation | 0.9088468442 |
Kurtosis | 9.886240471 |
Mean | 0.122804604 |
MAD | 0.07525685037 |
Skewness | 2.554551179 |
Sum | 629.4964 |
Variance | 0.01245692085 |
Memory size | 78.2 KiB |
Value | Count | Frequency (%) | |
0.0825 | 218 | 2.2% | |
0.0619 | 207 | 2.1% | |
0.0928 | 183 | 1.8% | |
0.0722 | 146 | 1.5% | |
0.1485 | 113 | 1.1% | |
0.1031 | 100 | 1.0% | |
0.1237 | 92 | 0.9% | |
0.0082 | 89 | 0.9% | |
0.0124 | 80 | 0.8% | |
0.0165 | 74 | 0.7% | |
Other values (785) | 3824 | 38.2% | |
(Missing) | 4874 | 48.7% |
Minimum 5 values
Value | Count | Frequency (%) | |
0 | 32 | 0.3% | |
0.001 | 10 | 0.1% | |
0.0021 | 19 | 0.2% | |
0.0031 | 13 | 0.1% | |
0.0037 | 1 | < 0.1% |
Maximum 5 values
Value | Count | Frequency (%) | |
1 | 3 | < 0.1% | |
0.9701 | 1 | < 0.1% | |
0.9577 | 1 | < 0.1% | |
0.8887 | 1 | < 0.1% | |
0.8691 | 1 | < 0.1% |
APARTMENTS_MEDI
Highly correlated
This variable is highly correlated with APARTMENTS_AVG
and should be ignored for analysis
Correlation | 0.9972131584 |
---|
APARTMENTS_MODE
Highly correlated
This variable is highly correlated with APARTMENTS_MEDI
and should be ignored for analysis
Correlation | 0.9817942349 |
---|
BASEMENTAREA_AVG
Numeric
Distinct count | 1673 |
---|---|
Unique (%) | 16.7% |
Missing (%) | 56.9% |
Missing (n) | 5686 |
Infinite (%) | 0.0% |
Infinite (n) | 0 |
Mean | 0.09208103848 |
---|---|
Minimum | 0 |
Maximum | 1 |
Zeros (%) | 4.4% |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0.0488 |
Median | 0.0784 |
Q3 | 0.113275 |
95-th percentile | 0.23137 |
Maximum | 1 |
Range | 1 |
Interquartile range | 0.064475 |
Descriptive statistics
Standard deviation | 0.08730425292 |
---|---|
Coef of variation | 0.9481241129 |
Kurtosis | 32.23789829 |
Mean | 0.09208103848 |
MAD | 0.05312190079 |
Skewness | 4.155863851 |
Sum | 397.2376 |
Variance | 0.007622032578 |
Memory size | 78.2 KiB |
Value | Count | Frequency (%) | |
0 | 444 | 4.4% | |
0.0545 | 14 | 0.1% | |
0.0635 | 13 | 0.1% | |
0.1091 | 13 | 0.1% | |
0.0803 | 12 | 0.1% | |
0.0764 | 11 | 0.1% | |
0.0503 | 10 | 0.1% | |
0.0795 | 10 | 0.1% | |
0.1018 | 10 | 0.1% | |
0.0691 | 10 | 0.1% | |
Other values (1662) | 3767 | 37.7% | |
(Missing) | 5686 | 56.9% |
Minimum 5 values
Value | Count | Frequency (%) | |
0 | 444 | 4.4% | |
0.0001 | 3 | < 0.1% | |
0.0002 | 1 | < 0.1% | |
0.0004 | 2 | < 0.1% | |
0.0005 | 2 | < 0.1% |
Maximum 5 values
Value | Count | Frequency (%) | |
1 | 8 | 0.1% | |
0.9636 | 1 | < 0.1% | |
0.9397 | 1 | < 0.1% | |
0.8246 | 2 | < 0.1% | |
0.8152 | 1 | < 0.1% |
BASEMENTAREA_MEDI
Highly correlated
This variable is highly correlated with BASEMENTAREA_AVG
and should be ignored for analysis
Correlation | 0.985925455 |
---|
BASEMENTAREA_MODE
Highly correlated
This variable is highly correlated with BASEMENTAREA_MEDI
and should be ignored for analysis
Correlation | 0.9731043737 |
---|
CNT_CHILDREN
Numeric
Distinct count | 7 |
---|---|
Unique (%) | 0.1% |
Missing (%) | 0.0% |
Missing (n) | 0 |
Infinite (%) | 0.0% |
Infinite (n) | 0 |
Mean | 0.3858 |
---|---|
Minimum | 0 |
Maximum | 8 |
Zeros (%) | 71.7% |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
Median | 0 |
Q3 | 1 |
95-th percentile | 2 |
Maximum | 8 |
Range | 8 |
Interquartile range | 1 |
Descriptive statistics
Standard deviation | 0.6929692494 |
---|---|
Coef of variation | 1.79618779 |
Kurtosis | 4.706418144 |
Mean | 0.3858 |
MAD | 0.55300572 |
Skewness | 1.964242915 |
Sum | 3858 |
Variance | 0.4802063806 |
Memory size | 78.2 KiB |
Value | Count | Frequency (%) | |
0 | 7167 | 71.7% | |
1 | 1962 | 19.6% | |
2 | 740 | 7.4% | |
3 | 116 | 1.2% | |
4 | 10 | 0.1% | |
5 | 4 | < 0.1% | |
8 | 1 | < 0.1% |
Minimum 5 values
Value | Count | Frequency (%) | |
0 | 7167 | 71.7% | |
1 | 1962 | 19.6% | |
2 | 740 | 7.4% | |
3 | 116 | 1.2% | |
4 | 10 | 0.1% |
Maximum 5 values
Value | Count | Frequency (%) | |
8 | 1 | < 0.1% | |
5 | 4 | < 0.1% | |
4 | 10 | 0.1% | |
3 | 116 | 1.2% | |
2 | 740 | 7.4% |
CNT_FAM_MEMBERS
Numeric
Distinct count | 8 |
---|---|
Unique (%) | 0.1% |
Missing (%) | 0.0% |
Missing (n) | 0 |
Infinite (%) | 0.0% |
Infinite (n) | 0 |
Mean | 2.1342 |
---|---|
Minimum | 1 |
Maximum | 10 |
Zeros (%) | 0.0% |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 2 |
Median | 2 |
Q3 | 2 |
95-th percentile | 4 |
Maximum | 10 |
Range | 9 |
Interquartile range | 0 |
Descriptive statistics
Standard deviation | 0.8769648036 |
---|---|
Coef of variation | 0.4109103194 |
Kurtosis | 1.846921188 |
Mean | 2.1342 |
MAD | 0.62314416 |
Skewness | 0.9865700905 |
Sum | 21342 |
Variance | 0.7690672667 |
Memory size | 78.2 KiB |
Value | Count | Frequency (%) | |
2 | 5418 | 54.2% | |
1 | 2106 | 21.1% | |
3 | 1653 | 16.5% | |
4 | 697 | 7.0% | |
5 | 111 | 1.1% | |
6 | 10 | 0.1% | |
7 | 4 | < 0.1% | |
10 | 1 | < 0.1% |
Minimum 5 values
Value | Count | Frequency (%) | |
1 | 2106 | 21.1% | |
2 | 5418 | 54.2% | |
3 | 1653 | 16.5% | |
4 | 697 | 7.0% | |
5 | 111 | 1.1% |
Maximum 5 values
Value | Count | Frequency (%) | |
10 | 1 | < 0.1% | |
7 | 4 | < 0.1% | |
6 | 10 | 0.1% | |
5 | 111 | 1.1% | |
4 | 697 | 7.0% |
CODE_GENDER
Categorical
Distinct count | 2 |
---|---|
Unique (%) | < 0.1% |
Missing (%) | 0.0% |
Missing (n) | 0 |
F | |
---|---|
M |
Value | Count | Frequency (%) | |
F | 6708 | 67.1% | |
M | 3292 | 32.9% |
Max length | 1 |
---|---|
Mean length | 1 |
Min length | 1 |
Contains chars | True |
Contains digits | False |
Contains spaces | False |
Contains non-words | False |
COMMONAREA_AVG
Numeric
Distinct count | 1092 |
---|---|
Unique (%) | 10.9% |
Missing (%) | 68.9% |
Missing (n) | 6893 |
Infinite (%) | 0.0% |
Infinite (n) | 0 |
Mean | 0.04834917927 |
---|---|
Minimum | 0 |
Maximum | 1 |
Zeros (%) | 2.7% |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0.0086 |
Median | 0.0231 |
Q3 | 0.05385 |
95-th percentile | 0.1737 |
Maximum | 1 |
Range | 1 |
Interquartile range | 0.04525 |
Descriptive statistics
Standard deviation | 0.08674752332 |
---|---|
Coef of variation | 1.794188125 |
Kurtosis | 45.14787836 |
Mean | 0.04834917927 |
MAD | 0.04574583521 |
Skewness | 5.663283057 |
Sum | 150.2209 |
Variance | 0.007525132803 |
Memory size | 78.2 KiB |
Value | Count | Frequency (%) | |
0 | 274 | 2.7% | |
0.0077 | 20 | 0.2% | |
0.0079 | 19 | 0.2% | |
0.0078 | 16 | 0.2% | |
0.0087 | 14 | 0.1% | |
0.0086 | 14 | 0.1% | |
0.0121 | 14 | 0.1% | |
0.0014 | 13 | 0.1% | |
0.0124 | 13 | 0.1% | |
0.0118 | 12 | 0.1% | |
Other values (1081) | 2698 | 27.0% | |
(Missing) | 6893 | 68.9% |
Minimum 5 values
Value | Count | Frequency (%) | |
0 | 274 | 2.7% | |
0.0001 | 2 | < 0.1% | |
0.0002 | 3 | < 0.1% | |
0.0003 | 3 | < 0.1% | |
0.0005 | 2 | < 0.1% |
Maximum 5 values
Value | Count | Frequency (%) | |
1 | 6 | 0.1% | |
0.8882 | 1 | < 0.1% | |
0.8876 | 1 | < 0.1% | |
0.7441 | 2 | < 0.1% | |
0.6761 | 1 | < 0.1% |
COMMONAREA_MEDI
Highly correlated
This variable is highly correlated with COMMONAREA_AVG
and should be ignored for analysis
Correlation | 0.9956920971 |
---|
COMMONAREA_MODE
Highly correlated
This variable is highly correlated with COMMONAREA_MEDI
and should be ignored for analysis
Correlation | 0.985031091 |
---|
DAYS_BIRTH
Numeric
Distinct count | 7409 |
---|---|
Unique (%) | 74.1% |
Missing (%) | 0.0% |
Missing (n) | 0 |
Infinite (%) | 0.0% |
Infinite (n) | 0 |
Mean | -16070.6283 |
---|---|
Minimum | -25087 |
Maximum | -7338 |
Zeros (%) | 0.0% |
Quantile statistics
Minimum | -25087 |
---|---|
5-th percentile | -23120.1 |
Q1 | -19627.5 |
Median | -15785.5 |
Q3 | -12530.5 |
95-th percentile | -9484.8 |
Maximum | -7338 |
Range | 17749 |
Interquartile range | 7097 |
Descriptive statistics
Standard deviation | 4309.713586 |
---|---|
Coef of variation | -0.2681733101 |
Kurtosis | -1.034573587 |
Mean | -16070.6283 |
MAD | 3673.514906 |
Skewness | -0.1035620769 |
Sum | -160706283 |
Variance | 18573631.19 |
Memory size | 78.2 KiB |
Value | Count | Frequency (%) | |
-21503 | 6 | 0.1% | |
-15382 | 5 | 0.1% | |
-11876 | 5 | 0.1% | |
-14729 | 5 | 0.1% | |
-17875 | 5 | 0.1% | |
-13999 | 5 | 0.1% | |
-21152 | 5 | 0.1% | |
-15977 | 5 | 0.1% | |
-15251 | 4 | < 0.1% | |
-17716 | 4 | < 0.1% | |
Other values (7399) | 9951 | 99.5% |
Minimum 5 values
Value | Count | Frequency (%) | |
-25087 | 1 | < 0.1% | |
-25021 | 1 | < 0.1% | |
-24985 | 1 | < 0.1% | |
-24975 | 1 | < 0.1% | |
-24852 | 1 | < 0.1% |
Maximum 5 values
Value | Count | Frequency (%) | |
-7338 | 1 | < 0.1% | |
-7725 | 1 | < 0.1% | |
-7742 | 1 | < 0.1% | |
-7756 | 1 | < 0.1% | |
-7954 | 1 | < 0.1% |
DAYS_EMPLOYED
Numeric
Distinct count | 4225 |
---|---|
Unique (%) | 42.2% |
Missing (%) | 0.0% |
Missing (n) | 0 |
Infinite (%) | 0.0% |
Infinite (n) | 0 |
Mean | 66290.9107 |
---|---|
Minimum | -17124 |
Maximum | 365243 |
Zeros (%) | 0.0% |
Quantile statistics
Minimum | -17124 |
---|---|
5-th percentile | -6865.1 |
Q1 | -2941.25 |
Median | -1327 |
Q3 | -309 |
95-th percentile | 365243 |
Maximum | 365243 |
Range | 382367 |
Interquartile range | 2632.25 |
Descriptive statistics
Standard deviation | 143446.2907 |
---|---|
Coef of variation | 2.163890784 |
Kurtosis | 0.5745332818 |
Mean | 66290.9107 |
MAD | 111867.8718 |
Skewness | 1.604038566 |
Sum | 662909107 |
Variance | 2.057683832e+10 |
Memory size | 78.2 KiB |
Value | Count | Frequency (%) | |
365243 | 1871 | 18.7% | |
-1342 | 10 | 0.1% | |
-1814 | 9 | 0.1% | |
-861 | 9 | 0.1% | |
-1919 | 8 | 0.1% | |
-1387 | 8 | 0.1% | |
-1027 | 8 | 0.1% | |
-386 | 8 | 0.1% | |
-989 | 8 | 0.1% | |
-1640 | 8 | 0.1% | |
Other values (4215) | 8053 | 80.5% |
Minimum 5 values
Value | Count | Frequency (%) | |
-17124 | 1 | < 0.1% | |
-17077 | 1 | < 0.1% | |
-16774 | 1 | < 0.1% | |
-16034 | 1 | < 0.1% | |
-15708 | 1 | < 0.1% |
Maximum 5 values
Value | Count | Frequency (%) | |
365243 | 1871 | 18.7% | |
-1 | 1 | < 0.1% | |
-14 | 1 | < 0.1% | |
-26 | 1 | < 0.1% | |
-32 | 1 | < 0.1% |
DAYS_ID_PUBLISH
Numeric
Distinct count | 4514 |
---|---|
Unique (%) | 45.1% |
Missing (%) | 0.0% |
Missing (n) | 0 |
Infinite (%) | 0.0% |
Infinite (n) | 0 |
Mean | -3036.6527 |
---|---|
Minimum | -6317 |
Maximum | 0 |
Zeros (%) | < 0.1% |
Quantile statistics
Minimum | -6317 |
---|---|
5-th percentile | -5132.05 |
Q1 | -4438 |
Median | -3186 |
Q3 | -1701 |
95-th percentile | -362 |
Maximum | 0 |
Range | 6317 |
Interquartile range | 2737 |
Descriptive statistics
Standard deviation | 1573.295197 |
---|---|
Coef of variation | -0.5181017891 |
Kurtosis | -1.158299024 |
Mean | -3036.6527 |
MAD | 1372.307166 |
Skewness | 0.2719956427 |
Sum | -30366527 |
Variance | 2475257.776 |
Memory size | 78.2 KiB |
Value | Count | Frequency (%) | |
-4215 | 11 | 0.1% | |
-4592 | 11 | 0.1% | |
-4255 | 10 | 0.1% | |
-4452 | 10 | 0.1% | |
-4570 | 10 | 0.1% | |
-4311 | 10 | 0.1% | |
-4521 | 10 | 0.1% | |
-4291 | 10 | 0.1% | |
-4324 | 10 | 0.1% | |
-4558 | 10 | 0.1% | |
Other values (4504) | 9898 | 99.0% |
Minimum 5 values
Value | Count | Frequency (%) | |
-6317 | 1 | < 0.1% | |
-6315 | 1 | < 0.1% | |
-6293 | 1 | < 0.1% | |
-6279 | 1 | < 0.1% | |
-6272 | 1 | < 0.1% |
Maximum 5 values
Value | Count | Frequency (%) | |
0 | 2 | < 0.1% | |
-1 | 2 | < 0.1% | |
-3 | 1 | < 0.1% | |
-4 | 1 | < 0.1% | |
-5 | 3 | < 0.1% |
DAYS_LAST_PHONE_CHANGE
Numeric
Distinct count | 2824 |
---|---|
Unique (%) | 28.2% |
Missing (%) | 0.0% |
Missing (n) | 0 |
Infinite (%) | 0.0% |
Infinite (n) | 0 |
Mean | -1087.4316 |
---|---|
Minimum | -4285 |
Maximum | 0 |
Zeros (%) | 11.7% |
Quantile statistics
Minimum | -4285 |
---|---|
5-th percentile | -2725.1 |
Q1 | -1769 |
Median | -874 |
Q3 | -379 |
95-th percentile | 0 |
Maximum | 0 |
Range | 4285 |
Interquartile range | 1390 |
Descriptive statistics
Standard deviation | 878.2755674 |
---|---|
Coef of variation | -0.8076605162 |
Kurtosis | -0.3454891625 |
Mean | -1087.4316 |
MAD | 739.867315 |
Skewness | -0.6733289404 |
Sum | -10874316 |
Variance | 771367.9723 |
Memory size | 78.2 KiB |
Value | Count | Frequency (%) | |
0 | 1170 | 11.7% | |
-1 | 34 | 0.3% | |
-1799 | 15 | 0.1% | |
-2 | 14 | 0.1% | |
-3 | 13 | 0.1% | |
-1765 | 13 | 0.1% | |
-1800 | 12 | 0.1% | |
-1783 | 12 | 0.1% | |
-1776 | 12 | 0.1% | |
-513 | 12 | 0.1% | |
Other values (2814) | 8693 | 86.9% |
Minimum 5 values
Value | Count | Frequency (%) | |
-4285 | 1 | < 0.1% | |
-4257 | 1 | < 0.1% | |
-4250 | 1 | < 0.1% | |
-4151 | 1 | < 0.1% | |
-4103 | 1 | < 0.1% |
Maximum 5 values
Value | Count | Frequency (%) | |
0 | 1170 | 11.7% | |
-1 | 34 | 0.3% | |
-2 | 14 | 0.1% | |
-3 | 13 | 0.1% | |
-4 | 11 | 0.1% |
DAYS_REGISTRATION
Numeric
Distinct count | 6691 |
---|---|
Unique (%) | 66.9% |
Missing (%) | 0.0% |
Missing (n) | 0 |
Infinite (%) | 0.0% |
Infinite (n) | 0 |
Mean | -5008.8966 |
---|---|
Minimum | -18747 |
Maximum | -1 |
Zeros (%) | 0.0% |
Quantile statistics
Minimum | -18747 |
---|---|
5-th percentile | -11512.2 |
Q1 | -7541 |
Median | -4509.5 |
Q3 | -1930.75 |
95-th percentile | -363.95 |
Maximum | -1 |
Range | 18746 |
Interquartile range | 5610.25 |
Descriptive statistics
Standard deviation | 3569.986356 |
---|---|
Coef of variation | -0.712729098 |
Kurtosis | -0.3460433651 |
Mean | -5008.8966 |
MAD | 2959.435805 |
Skewness | -0.6085388408 |
Sum | -50088966 |
Variance | 12744802.58 |
Memory size | 78.2 KiB |
Value | Count | Frequency (%) | |
-427 | 6 | 0.1% | |
-1051 | 6 | 0.1% | |
-756 | 6 | 0.1% | |
-1009 | 6 | 0.1% | |
-5312 | 6 | 0.1% | |
-4011 | 6 | 0.1% | |
-1045 | 6 | 0.1% | |
-83 | 6 | 0.1% | |
-5839 | 6 | 0.1% | |
-2902 | 6 | 0.1% | |
Other values (6681) | 9940 | 99.4% |
Minimum 5 values
Value | Count | Frequency (%) | |
-18747 | 1 | < 0.1% | |
-18590 | 1 | < 0.1% | |
-17568 | 2 | < 0.1% | |
-17450 | 1 | < 0.1% | |
-17358 | 1 | < 0.1% |
Maximum 5 values
Value | Count | Frequency (%) | |
-1 | 3 | < 0.1% | |
-2 | 2 | < 0.1% | |
-3 | 2 | < 0.1% | |
-4 | 1 | < 0.1% | |
-5 | 1 | < 0.1% |
DEF_30_CNT_SOCIAL_CIRCLE
Numeric
Distinct count | 6 |
---|---|
Unique (%) | 0.1% |
Missing (%) | 0.1% |
Missing (n) | 7 |
Infinite (%) | 0.0% |
Infinite (n) | 0 |
Mean | 0.1364955469 |
---|---|
Minimum | 0 |
Maximum | 4 |
Zeros (%) | 88.9% |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
Median | 0 |
Q3 | 0 |
95-th percentile | 1 |
Maximum | 4 |
Range | 4 |
Interquartile range | 0 |
Descriptive statistics
Standard deviation | 0.4281689741 |
---|---|
Coef of variation | 3.136871377 |
Kurtosis | 18.13073275 |
Mean | 0.1364955469 |
MAD | 0.2428864019 |
Skewness | 3.833958826 |
Sum | 1364 |
Variance | 0.1833286703 |
Memory size | 78.2 KiB |
Value | Count | Frequency (%) | |
0 | 8891 | 88.9% | |
1 | 894 | 8.9% | |
2 | 165 | 1.7% | |
3 | 32 | 0.3% | |
4 | 11 | 0.1% | |
(Missing) | 7 | 0.1% |
Minimum 5 values
Value | Count | Frequency (%) | |
0 | 8891 | 88.9% | |
1 | 894 | 8.9% | |
2 | 165 | 1.7% | |
3 | 32 | 0.3% | |
4 | 11 | 0.1% |
Maximum 5 values
Value | Count | Frequency (%) | |
4 | 11 | 0.1% | |
3 | 32 | 0.3% | |
2 | 165 | 1.7% | |
1 | 894 | 8.9% | |
0 | 8891 | 88.9% |
DEF_60_CNT_SOCIAL_CIRCLE
Numeric
Distinct count | 6 |
---|---|
Unique (%) | 0.1% |
Missing (%) | 0.1% |
Missing (n) | 7 |
Infinite (%) | 0.0% |
Infinite (n) | 0 |
Mean | 0.09476633644 |
---|---|
Minimum | 0 |
Maximum | 4 |
Zeros (%) | 91.8% |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
Median | 0 |
Q3 | 0 |
95-th percentile | 1 |
Maximum | 4 |
Range | 4 |
Interquartile range | 0 |
Descriptive statistics
Standard deviation | 0.3447037634 |
---|---|
Coef of variation | 3.637407294 |
Kurtosis | 22.40205706 |
Mean | 0.09476633644 |
MAD | 0.1742077054 |
Skewness | 4.298849848 |
Sum | 947 |
Variance | 0.1188206845 |
Memory size | 78.2 KiB |
Value | Count | Frequency (%) | |
0 | 9185 | 91.8% | |
1 | 693 | 6.9% | |
2 | 93 | 0.9% | |
3 | 20 | 0.2% | |
4 | 2 | < 0.1% | |
(Missing) | 7 | 0.1% |
Minimum 5 values
Value | Count | Frequency (%) | |
0 | 9185 | 91.8% | |
1 | 693 | 6.9% | |
2 | 93 | 0.9% | |
3 | 20 | 0.2% | |
4 | 2 | < 0.1% |
Maximum 5 values
Value | Count | Frequency (%) | |
4 | 2 | < 0.1% | |
3 | 20 | 0.2% | |
2 | 93 | 0.9% | |
1 | 693 | 6.9% | |
0 | 9185 | 91.8% |
ELEVATORS_AVG
Numeric
Distinct count | 112 |
---|---|
Unique (%) | 1.1% |
Missing (%) | 51.7% |
Missing (n) | 5172 |
Infinite (%) | 0.0% |
Infinite (n) | 0 |
Mean | 0.08456197183 |
---|---|
Minimum | 0 |
Maximum | 1 |
Zeros (%) | 28.1% |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
Median | 0 |
Q3 | 0.16 |
95-th percentile | 0.37762 |
Maximum | 1 |
Range | 1 |
Interquartile range | 0.16 |
Descriptive statistics
Standard deviation | 0.1384101065 |
---|---|
Coef of variation | 1.636789014 |
Kurtosis | 5.885136441 |
Mean | 0.08456197183 |
MAD | 0.1028852235 |
Skewness | 2.199040584 |
Sum | 408.2652 |
Variance | 0.01915735758 |
Memory size | 78.2 KiB |
Value | Count | Frequency (%) | |
0 | 2806 | 28.1% | |
0.16 | 337 | 3.4% | |
0.08 | 332 | 3.3% | |
0.24 | 208 | 2.1% | |
0.12 | 181 | 1.8% | |
0.2 | 139 | 1.4% | |
0.04 | 139 | 1.4% | |
0.32 | 101 | 1.0% | |
0.4 | 78 | 0.8% | |
0.28 | 75 | 0.8% | |
Other values (101) | 432 | 4.3% | |
(Missing) | 5172 | 51.7% |
Minimum 5 values
Value | Count | Frequency (%) | |
0 | 2806 | 28.1% | |
0.0064 | 3 | < 0.1% | |
0.008 | 1 | < 0.1% | |
0.0132 | 1 | < 0.1% | |
0.016 | 1 | < 0.1% |
Maximum 5 values
Value | Count | Frequency (%) | |
1 | 2 | < 0.1% | |
0.96 | 1 | < 0.1% | |
0.92 | 1 | < 0.1% | |
0.88 | 3 | < 0.1% | |
0.84 | 2 | < 0.1% |
ELEVATORS_MEDI
Highly correlated
This variable is highly correlated with ELEVATORS_AVG
and should be ignored for analysis
Correlation | 0.9956755484 |
---|
ELEVATORS_MODE
Highly correlated
This variable is highly correlated with ELEVATORS_MEDI
and should be ignored for analysis
Correlation | 0.9866157093 |
---|
EMERGENCYSTATE_MODE
Boolean
Distinct count | 3 |
---|---|
Unique (%) | < 0.1% |
Missing (%) | 45.3% |
Missing (n) | 4527 |
No | |
---|---|
Yes | 91 |
(Missing) |
Value | Count | Frequency (%) | |
No | 5382 | 53.8% | |
Yes | 91 | 0.9% | |
(Missing) | 4527 | 45.3% |
ENTRANCES_AVG
Numeric
Distinct count | 116 |
---|---|
Unique (%) | 1.2% |
Missing (%) | 48.3% |
Missing (n) | 4830 |
Infinite (%) | 0.0% |
Infinite (n) | 0 |
Mean | 0.1528465957 |
---|---|
Minimum | 0 |
Maximum | 1 |
Zeros (%) | 0.1% |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0.0345 |
Q1 | 0.0917 |
Median | 0.1379 |
Q3 | 0.2069 |
95-th percentile | 0.3276 |
Maximum | 1 |
Range | 1 |
Interquartile range | 0.1152 |
Descriptive statistics
Standard deviation | 0.09781239653 |
---|---|
Coef of variation | 0.6399383385 |
Kurtosis | 10.53739394 |
Mean | 0.1528465957 |
MAD | 0.06954304111 |
Skewness | 2.231446944 |
Sum | 790.2169 |
Variance | 0.009567264915 |
Memory size | 78.2 KiB |
Value | Count | Frequency (%) | |
0.1379 | 1178 | 11.8% | |
0.069 | 708 | 7.1% | |
0.2069 | 703 | 7.0% | |
0.1034 | 666 | 6.7% | |
0.0345 | 475 | 4.8% | |
0.1724 | 316 | 3.2% | |
0.2759 | 263 | 2.6% | |
0.2414 | 143 | 1.4% | |
0.3448 | 102 | 1.0% | |
0.3103 | 70 | 0.7% | |
Other values (105) | 546 | 5.5% | |
(Missing) | 4830 | 48.3% |
Minimum 5 values
Value | Count | Frequency (%) | |
0 | 12 | 0.1% | |
0.0345 | 475 | 4.8% | |
0.0414 | 2 | < 0.1% | |
0.0459 | 1 | < 0.1% | |
0.0483 | 1 | < 0.1% |
Maximum 5 values
Value | Count | Frequency (%) | |
1 | 4 | < 0.1% | |
0.931 | 1 | < 0.1% | |
0.8966 | 2 | < 0.1% | |
0.8276 | 2 | < 0.1% | |
0.7931 | 1 | < 0.1% |
ENTRANCES_MEDI
Highly correlated
This variable is highly correlated with ENTRANCES_AVG
and should be ignored for analysis
Correlation | 0.9968285611 |
---|
ENTRANCES_MODE
Highly correlated
This variable is highly correlated with ENTRANCES_MEDI
and should be ignored for analysis
Correlation | 0.9800705269 |
---|
EXT_SOURCE_1
Numeric
Distinct count | 5775 |
---|---|
Unique (%) | 57.8% |
Missing (%) | 41.7% |
Missing (n) | 4171 |
Infinite (%) | 0.0% |
Infinite (n) | 0 |
Mean | 0.5033719905 |
---|---|
Minimum | 0.01601914037 |
Maximum | 0.9288637886 |
Zeros (%) | 0.0% |
Quantile statistics
Minimum | 0.01601914037 |
---|---|
5-th percentile | 0.1553925017 |
Q1 | 0.3458151757 |
Median | 0.508432704 |
Q3 | 0.6685842846 |
95-th percentile | 0.8222621292 |
Maximum | 0.9288637886 |
Range | 0.9128446482 |
Interquartile range | 0.3227691089 |
Descriptive statistics
Standard deviation | 0.205239556 |
---|---|
Coef of variation | 0.4077293928 |
Kurtosis | -0.867250267 |
Mean | 0.5033719905 |
MAD | 0.172350254 |
Skewness | -0.1204380734 |
Sum | 2934.155333 |
Variance | 0.04212327536 |
Memory size | 78.2 KiB |
Value | Count | Frequency (%) | |
0.6179579067 | 2 | < 0.1% | |
0.2836959651 | 2 | < 0.1% | |
0.6124902241 | 2 | < 0.1% | |
0.3754127288 | 2 | < 0.1% | |
0.7505349625 | 2 | < 0.1% | |
0.4386229734 | 2 | < 0.1% | |
0.582051633 | 2 | < 0.1% | |
0.5007563873 | 2 | < 0.1% | |
0.5090070974 | 2 | < 0.1% | |
0.7425066257 | 2 | < 0.1% | |
Other values (5764) | 5809 | 58.1% | |
(Missing) | 4171 | 41.7% |
Minimum 5 values
Value | Count | Frequency (%) | |
0.01601914037 | 1 | < 0.1% | |
0.02156946545 | 1 | < 0.1% | |
0.02826437195 | 1 | < 0.1% | |
0.02903901299 | 1 | < 0.1% | |
0.03045444096 | 1 | < 0.1% |
Maximum 5 values
Value | Count | Frequency (%) | |
0.9288637886 | 1 | < 0.1% | |
0.9248527537 | 1 | < 0.1% | |
0.9228240409 | 1 | < 0.1% | |
0.9200405934 | 1 | < 0.1% | |
0.9187540813 | 1 | < 0.1% |
EXT_SOURCE_2
Numeric
Distinct count | 9327 |
---|---|
Unique (%) | 93.3% |
Missing (%) | 0.0% |
Missing (n) | 0 |
Infinite (%) | 0.0% |
Infinite (n) | 0 |
Mean | 0.5150579106 |
---|---|
Minimum | 8.097855876e-06 |
Maximum | 0.8128863176 |
Zeros (%) | 0.0% |
Quantile statistics
Minimum | 8.097855876e-06 |
---|---|
5-th percentile | 0.1488341288 |
Q1 | 0.4039053173 |
Median | 0.5567594897 |
Q3 | 0.6582363347 |
95-th percentile | 0.7477889978 |
Maximum | 0.8128863176 |
Range | 0.8128782198 |
Interquartile range | 0.2543310174 |
Descriptive statistics
Standard deviation | 0.1837955653 |
---|---|
Coef of variation | 0.3568444665 |
Kurtosis | -0.1594598234 |
Mean | 0.5150579106 |
MAD | 0.1497998378 |
Skewness | -0.7725568333 |
Sum | 5150.579106 |
Variance | 0.03378080984 |
Memory size | 78.2 KiB |
Value | Count | Frequency (%) | |
0.2858978721 | 20 | 0.2% | |
0.2518770397 | 18 | 0.2% | |
0.5637630974 | 16 | 0.2% | |
0.1512429506 | 12 | 0.1% | |
0.5716197626 | 11 | 0.1% | |
0.5049820605 | 10 | 0.1% | |
0.1504111262 | 8 | 0.1% | |
0.3435402316 | 8 | 0.1% | |
0.1822756515 | 8 | 0.1% | |
0.5571533945 | 6 | 0.1% | |
Other values (9317) | 9883 | 98.8% |
Minimum 5 values
Value | Count | Frequency (%) | |
8.097855876e-06 | 1 | < 0.1% | |
0.0008046855282 | 1 | < 0.1% | |
0.00104415259 | 1 | < 0.1% | |
0.001064976875 | 1 | < 0.1% | |
0.00115555594 | 1 | < 0.1% |
Maximum 5 values
Value | Count | Frequency (%) | |
0.8128863176 | 1 | < 0.1% | |
0.8108086695 | 1 | < 0.1% | |
0.8107516374 | 1 | < 0.1% | |
0.8087877918 | 1 | < 0.1% | |
0.8065253627 | 1 | < 0.1% |
EXT_SOURCE_3
Numeric
Distinct count | 613 |
---|---|
Unique (%) | 6.1% |
Missing (%) | 18.1% |
Missing (n) | 1813 |
Infinite (%) | 0.0% |
Infinite (n) | 0 |
Mean | 0.4988269926 |
---|---|
Minimum | 0.0005272652387 |
Maximum | 0.8802684804 |
Zeros (%) | 0.0% |
Quantile statistics
Minimum | 0.0005272652387 |
---|---|
5-th percentile | 0.1585548998 |
Q1 | 0.362277247 |
Median | 0.5172965814 |
Q3 | 0.652896552 |
95-th percentile | 0.7738956942 |
Maximum | 0.8802684804 |
Range | 0.8797412151 |
Interquartile range | 0.2906193049 |
Descriptive statistics
Standard deviation | 0.1893013231 |
---|---|
Coef of variation | 0.3794929423 |
Kurtosis | -0.7377331193 |
Mean | 0.4988269926 |
MAD | 0.1585421394 |
Skewness | -0.3325049161 |
Sum | 4083.896588 |
Variance | 0.03583499094 |
Memory size | 78.2 KiB |
Value | Count | Frequency (%) | |
0.7136313997 | 55 | 0.5% | |
0.6706517531 | 54 | 0.5% | |
0.5585066277 | 44 | 0.4% | |
0.6940926425 | 44 | 0.4% | |
0.5797274228 | 43 | 0.4% | |
0.6832688314 | 43 | 0.4% | |
0.6263042767 | 41 | 0.4% | |
0.5388627066 | 40 | 0.4% | |
0.7463002131 | 40 | 0.4% | |
0.7062051097 | 40 | 0.4% | |
Other values (602) | 7743 | 77.4% | |
(Missing) | 1813 | 18.1% |
Minimum 5 values
Value | Count | Frequency (%) | |
0.0005272652387 | 14 | 0.1% | |
0.02474394007 | 1 | < 0.1% | |
0.02654542968 | 1 | < 0.1% | |
0.03075308117 | 1 | < 0.1% | |
0.03635374366 | 1 | < 0.1% |
Maximum 5 values
Value | Count | Frequency (%) | |
0.8802684804 | 1 | < 0.1% | |
0.8771942582 | 2 | < 0.1% | |
0.8724558162 | 1 | < 0.1% | |
0.8633633824 | 1 | < 0.1% | |
0.859924176 | 1 | < 0.1% |
FLAG_CONT_MOBILE
Boolean
Distinct count | 2 |
---|---|
Unique (%) | < 0.1% |
Missing (%) | 0.0% |
Missing (n) | 0 |
1 | |
---|---|
0 | 19 |
Value | Count | Frequency (%) | |
1 | 9981 | 99.8% | |
0 | 19 | 0.2% |
FLAG_DOCUMENT_10
Constant
This variable is constant and should be ignored for analysis
Constant value | 0 |
---|
FLAG_DOCUMENT_11
Boolean
Distinct count | 2 |
---|---|
Unique (%) | < 0.1% |
Missing (%) | 0.0% |
Missing (n) | 0 |
0 | |
---|---|
1 | 13 |
Value | Count | Frequency (%) | |
0 | 9987 | 99.9% | |
1 | 13 | 0.1% |
FLAG_DOCUMENT_12
Constant
This variable is constant and should be ignored for analysis
Constant value | 0 |
---|
FLAG_DOCUMENT_13
Constant
This variable is constant and should be ignored for analysis
Constant value | 0 |
---|
FLAG_DOCUMENT_14
Constant
This variable is constant and should be ignored for analysis
Constant value | 0 |
---|
FLAG_DOCUMENT_15
Constant
This variable is constant and should be ignored for analysis
Constant value | 0 |
---|
FLAG_DOCUMENT_16
Constant
This variable is constant and should be ignored for analysis
Constant value | 0 |
---|
FLAG_DOCUMENT_17
Constant
This variable is constant and should be ignored for analysis
Constant value | 0 |
---|
FLAG_DOCUMENT_18
Boolean
Distinct count | 2 |
---|---|
Unique (%) | < 0.1% |
Missing (%) | 0.0% |
Missing (n) | 0 |
0 | |
---|---|
1 | 11 |
Value | Count | Frequency (%) | |
0 | 9989 | 99.9% | |
1 | 11 | 0.1% |
FLAG_DOCUMENT_19
Constant
This variable is constant and should be ignored for analysis
Constant value | 0 |
---|
FLAG_DOCUMENT_2
Constant
This variable is constant and should be ignored for analysis
Constant value | 0 |
---|
FLAG_DOCUMENT_20
Constant
This variable is constant and should be ignored for analysis
Constant value | 0 |
---|
FLAG_DOCUMENT_21
Constant
This variable is constant and should be ignored for analysis
Constant value | 0 |
---|
FLAG_DOCUMENT_3
Boolean
Distinct count | 2 |
---|---|
Unique (%) | < 0.1% |
Missing (%) | 0.0% |
Missing (n) | 0 |
1 | |
---|---|
0 |
Value | Count | Frequency (%) | |
1 | 7866 | 78.7% | |
0 | 2134 | 21.3% |
FLAG_DOCUMENT_4
Boolean
Distinct count | 2 |
---|---|
Unique (%) | < 0.1% |
Missing (%) | 0.0% |
Missing (n) | 0 |
0 | |
---|---|
1 | 1 |
Value | Count | Frequency (%) | |
0 | 9999 | > 99.9% | |
1 | 1 | < 0.1% |
FLAG_DOCUMENT_5
Boolean
Distinct count | 2 |
---|---|
Unique (%) | < 0.1% |
Missing (%) | 0.0% |
Missing (n) | 0 |
0 | |
---|---|
1 | 146 |
Value | Count | Frequency (%) | |
0 | 9854 | 98.5% | |
1 | 146 | 1.5% |
FLAG_DOCUMENT_6
Boolean
Distinct count | 2 |
---|---|
Unique (%) | < 0.1% |
Missing (%) | 0.0% |
Missing (n) | 0 |
0 | |
---|---|
1 | 885 |
Value | Count | Frequency (%) | |
0 | 9115 | 91.1% | |
1 | 885 | 8.8% |
FLAG_DOCUMENT_7
Constant
This variable is constant and should be ignored for analysis
Constant value | 0 |
---|
FLAG_DOCUMENT_8
Boolean
Distinct count | 2 |
---|---|
Unique (%) | < 0.1% |
Missing (%) | 0.0% |
Missing (n) | 0 |
0 | |
---|---|
1 | 863 |
Value | Count | Frequency (%) | |
0 | 9137 | 91.4% | |
1 | 863 | 8.6% |
FLAG_DOCUMENT_9
Boolean
Distinct count | 2 |
---|---|
Unique (%) | < 0.1% |
Missing (%) | 0.0% |
Missing (n) | 0 |
0 | |
---|---|
1 | 49 |
Value | Count | Frequency (%) | |
0 | 9951 | 99.5% | |
1 | 49 | 0.5% |
FLAG_EMAIL
Boolean
Distinct count | 2 |
---|---|
Unique (%) | < 0.1% |
Missing (%) | 0.0% |
Missing (n) | 0 |
0 | |
---|---|
1 | 1639 |
Value | Count | Frequency (%) | |
0 | 8361 | 83.6% | |
1 | 1639 | 16.4% |
FLAG_EMP_PHONE
Boolean
Distinct count | 2 |
---|---|
Unique (%) | < 0.1% |
Missing (%) | 0.0% |
Missing (n) | 0 |
1 | |
---|---|
0 |
Value | Count | Frequency (%) | |
1 | 8129 | 81.3% | |
0 | 1871 | 18.7% |
FLAG_MOBIL
Constant
This variable is constant and should be ignored for analysis
Constant value | 1 |
---|
FLAG_OWN_CAR
Boolean
Distinct count | 2 |
---|---|
Unique (%) | < 0.1% |
Missing (%) | 0.0% |
Missing (n) | 0 |
N | |
---|---|
Y |
Value | Count | Frequency (%) | |
N | 6702 | 67.0% | |
Y | 3298 | 33.0% |
FLAG_OWN_REALTY
Boolean
Distinct count | 2 |
---|---|
Unique (%) | < 0.1% |
Missing (%) | 0.0% |
Missing (n) | 0 |
Y | |
---|---|
N |
Value | Count | Frequency (%) | |
Y | 6899 | 69.0% | |
N | 3101 | 31.0% |
FLAG_PHONE
Boolean
Distinct count | 2 |
---|---|
Unique (%) | < 0.1% |
Missing (%) | 0.0% |
Missing (n) | 0 |
0 | |
---|---|
1 |
Value | Count | Frequency (%) | |
0 | 7336 | 73.4% | |
1 | 2664 | 26.6% |
FLAG_WORK_PHONE
Boolean
Distinct count | 2 |
---|---|
Unique (%) | < 0.1% |
Missing (%) | 0.0% |
Missing (n) | 0 |
0 | |
---|---|
1 |
Value | Count | Frequency (%) | |
0 | 7914 | 79.1% | |
1 | 2086 | 20.9% |
FLOORSMAX_AVG
Numeric
Distinct count | 119 |
---|---|
Unique (%) | 1.2% |
Missing (%) | 47.7% |
Missing (n) | 4770 |
Infinite (%) | 0.0% |
Infinite (n) | 0 |
Mean | 0.2336585086 |
---|---|
Minimum | 0 |
Maximum | 1 |
Zeros (%) | 0.9% |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0.0417 |
Q1 | 0.1667 |
Median | 0.1667 |
Q3 | 0.3333 |
95-th percentile | 0.5417 |
Maximum | 1 |
Range | 1 |
Interquartile range | 0.1666 |
Descriptive statistics
Standard deviation | 0.1468518884 |
---|---|
Coef of variation | 0.6284893681 |
Kurtosis | 2.708651431 |
Mean | 0.2336585086 |
MAD | 0.117822811 |
Skewness | 1.296188806 |
Sum | 1222.034 |
Variance | 0.02156547713 |
Memory size | 78.2 KiB |
Value | Count | Frequency (%) | |
0.1667 | 2171 | 21.7% | |
0.3333 | 1107 | 11.1% | |
0.0417 | 416 | 4.2% | |
0.375 | 293 | 2.9% | |
0.125 | 216 | 2.2% | |
0.0833 | 195 | 1.9% | |
0 | 90 | 0.9% | |
0.4583 | 85 | 0.9% | |
0.6667 | 67 | 0.7% | |
0.625 | 66 | 0.7% | |
Other values (108) | 524 | 5.2% | |
(Missing) | 4770 | 47.7% |
Minimum 5 values
Value | Count | Frequency (%) | |
0 | 90 | 0.9% | |
0.0083 | 1 | < 0.1% | |
0.0208 | 2 | < 0.1% | |
0.0312 | 1 | < 0.1% | |
0.0417 | 416 | 4.2% |
Maximum 5 values
Value | Count | Frequency (%) | |
1 | 9 | 0.1% | |
0.9583 | 4 | < 0.1% | |
0.9167 | 1 | < 0.1% | |
0.875 | 8 | 0.1% | |
0.8333 | 1 | < 0.1% |
FLOORSMAX_MEDI
Highly correlated
This variable is highly correlated with FLOORSMAX_AVG
and should be ignored for analysis
Correlation | 0.9954940358 |
---|
FLOORSMAX_MODE
Highly correlated
This variable is highly correlated with FLOORSMAX_MEDI
and should be ignored for analysis
Correlation | 0.9904378273 |
---|
FLOORSMIN_AVG
Numeric
Distinct count | 102 |
---|---|
Unique (%) | 1.0% |
Missing (%) | 67.0% |
Missing (n) | 6702 |
Infinite (%) | 0.0% |
Infinite (n) | 0 |
Mean | 0.2397194663 |
---|---|
Minimum | 0 |
Maximum | 1 |
Zeros (%) | 0.8% |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0.0417 |
Q1 | 0.125 |
Median | 0.2083 |
Q3 | 0.375 |
95-th percentile | 0.5417 |
Maximum | 1 |
Range | 1 |
Interquartile range | 0.25 |
Descriptive statistics
Standard deviation | 0.1620630482 |
---|---|
Coef of variation | 0.6760529323 |
Kurtosis | 1.458476843 |
Mean | 0.2397194663 |
MAD | 0.1254815042 |
Skewness | 0.960930656 |
Sum | 790.5948 |
Variance | 0.02626443158 |
Memory size | 78.2 KiB |
Value | Count | Frequency (%) | |
0.2083 | 1169 | 11.7% | |
0.375 | 609 | 6.1% | |
0.0417 | 535 | 5.3% | |
0.0833 | 153 | 1.5% | |
0.4167 | 135 | 1.4% | |
0.1667 | 98 | 1.0% | |
0.125 | 94 | 0.9% | |
0 | 77 | 0.8% | |
0.5 | 49 | 0.5% | |
0.7083 | 43 | 0.4% | |
Other values (91) | 336 | 3.4% | |
(Missing) | 6702 | 67.0% |
Minimum 5 values
Value | Count | Frequency (%) | |
0 | 77 | 0.8% | |
0.0312 | 1 | < 0.1% | |
0.0333 | 1 | < 0.1% | |
0.0417 | 535 | 5.3% | |
0.0625 | 2 | < 0.1% |
Maximum 5 values
Value | Count | Frequency (%) | |
1 | 5 | 0.1% | |
0.9583 | 1 | < 0.1% | |
0.9167 | 6 | 0.1% | |
0.8054 | 1 | < 0.1% | |
0.7917 | 2 | < 0.1% |
FLOORSMIN_MEDI
Highly correlated
This variable is highly correlated with FLOORSMIN_AVG
and should be ignored for analysis
Correlation | 0.9971154706 |
---|
FLOORSMIN_MODE
Highly correlated
This variable is highly correlated with FLOORSMIN_MEDI
and should be ignored for analysis
Correlation | 0.988324407 |
---|
FONDKAPREMONT_MODE
Categorical
Distinct count | 5 |
---|---|
Unique (%) | < 0.1% |
Missing (%) | 67.3% |
Missing (n) | 6732 |
reg oper account | |
---|---|
reg oper spec account | 399 |
not specified | 197 |
(Missing) |
Value | Count | Frequency (%) | |
reg oper account | 2507 | 25.1% | |
reg oper spec account | 399 | 4.0% | |
not specified | 197 | 2.0% | |
org spec account | 165 | 1.7% | |
(Missing) | 6732 | 67.3% |
Max length | 21 |
---|---|
Mean length | 7.3888 |
Min length | 3 |
Contains chars | True |
Contains digits | False |
Contains spaces | True |
Contains non-words | True |
HOUR_APPR_PROCESS_START
Numeric
Distinct count | 24 |
---|---|
Unique (%) | 0.2% |
Missing (%) | 0.0% |
Missing (n) | 0 |
Infinite (%) | 0.0% |
Infinite (n) | 0 |
Mean | 12.0023 |
---|---|
Minimum | 0 |
Maximum | 23 |
Zeros (%) | < 0.1% |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 7 |
Q1 | 10 |
Median | 12 |
Q3 | 14 |
95-th percentile | 17 |
Maximum | 23 |
Range | 23 |
Interquartile range | 4 |
Descriptive statistics
Standard deviation | 3.278897654 |
---|---|
Coef of variation | 0.2731891099 |
Kurtosis | -0.1160833048 |
Mean | 12.0023 |
MAD | 2.62207582 |
Skewness | 0.01939548735 |
Sum | 120023 |
Variance | 10.75116983 |
Memory size | 78.2 KiB |
Value | Count | Frequency (%) | |
10 | 1329 | 13.3% | |
11 | 1200 | 12.0% | |
12 | 1160 | 11.6% | |
13 | 957 | 9.6% | |
9 | 891 | 8.9% | |
14 | 873 | 8.7% | |
15 | 750 | 7.5% | |
16 | 653 | 6.5% | |
8 | 521 | 5.2% | |
17 | 470 | 4.7% | |
Other values (14) | 1196 | 12.0% |
Minimum 5 values
Value | Count | Frequency (%) | |
0 | 3 | < 0.1% | |
1 | 3 | < 0.1% | |
2 | 7 | 0.1% | |
3 | 55 | 0.5% | |
4 | 61 | 0.6% |
Maximum 5 values
Value | Count | Frequency (%) | |
23 | 2 | < 0.1% | |
22 | 5 | 0.1% | |
21 | 15 | 0.1% | |
20 | 56 | 0.6% | |
19 | 138 | 1.4% |
HOUSETYPE_MODE
Categorical
Distinct count | 4 |
---|---|
Unique (%) | < 0.1% |
Missing (%) | 48.3% |
Missing (n) | 4826 |
block of flats | |
---|---|
specific housing | 54 |
terraced house | 42 |
(Missing) |
Value | Count | Frequency (%) | |
block of flats | 5078 | 50.8% | |
specific housing | 54 | 0.5% | |
terraced house | 42 | 0.4% | |
(Missing) | 4826 | 48.3% |
Max length | 16 |
---|---|
Mean length | 8.7022 |
Min length | 3 |
Contains chars | True |
Contains digits | False |
Contains spaces | True |
Contains non-words | True |
LANDAREA_AVG
Numeric
Distinct count | 1517 |
---|---|
Unique (%) | 15.2% |
Missing (%) | 57.9% |
Missing (n) | 5792 |
Infinite (%) | 0.0% |
Infinite (n) | 0 |
Mean | 0.06689510456 |
---|---|
Minimum | 0 |
Maximum | 1 |
Zeros (%) | 5.3% |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0.0196 |
Median | 0.0491 |
Q3 | 0.0867 |
95-th percentile | 0.191225 |
Maximum | 1 |
Range | 1 |
Interquartile range | 0.0671 |
Descriptive statistics
Standard deviation | 0.07895724279 |
---|---|
Coef of variation | 1.180314214 |
Kurtosis | 32.53946878 |
Mean | 0.06689510456 |
MAD | 0.04898799979 |
Skewness | 4.202332546 |
Sum | 281.4946 |
Variance | 0.006234246189 |
Memory size | 78.2 KiB |
Value | Count | Frequency (%) | |
0 | 531 | 5.3% | |
0.02 | 10 | 0.1% | |
0.0564 | 10 | 0.1% | |
0.0319 | 9 | 0.1% | |
0.0292 | 9 | 0.1% | |
0.0221 | 9 | 0.1% | |
0.0638 | 9 | 0.1% | |
0.0142 | 9 | 0.1% | |
0.0486 | 9 | 0.1% | |
0.0611 | 8 | 0.1% | |
Other values (1506) | 3595 | 35.9% | |
(Missing) | 5792 | 57.9% |
Minimum 5 values
Value | Count | Frequency (%) | |
0 | 531 | 5.3% | |
0.0004 | 1 | < 0.1% | |
0.0006 | 1 | < 0.1% | |
0.0009 | 1 | < 0.1% | |
0.0011 | 1 | < 0.1% |
Maximum 5 values
Value | Count | Frequency (%) | |
1 | 5 | 0.1% | |
0.8638 | 1 | < 0.1% | |
0.7067 | 1 | < 0.1% | |
0.6767 | 1 | < 0.1% | |
0.6043 | 1 | < 0.1% |
LANDAREA_MEDI
Highly correlated
This variable is highly correlated with LANDAREA_AVG
and should be ignored for analysis
Correlation | 0.9803449477 |
---|
LANDAREA_MODE
Highly correlated
This variable is highly correlated with LANDAREA_MEDI
and should be ignored for analysis
Correlation | 0.9902680947 |
---|
LIVE_CITY_NOT_WORK_CITY
Boolean
Distinct count | 2 |
---|---|
Unique (%) | < 0.1% |
Missing (%) | 0.0% |
Missing (n) | 0 |
0 | |
---|---|
1 |
Value | Count | Frequency (%) | |
0 | 8289 | 82.9% | |
1 | 1711 | 17.1% |
LIVE_REGION_NOT_WORK_REGION
Boolean
Distinct count | 2 |
---|---|
Unique (%) | < 0.1% |
Missing (%) | 0.0% |
Missing (n) | 0 |
0 | |
---|---|
1 | 414 |
Value | Count | Frequency (%) | |
0 | 9586 | 95.9% | |
1 | 414 | 4.1% |
LIVINGAPARTMENTS_AVG
Highly correlated
This variable is highly correlated with APARTMENTS_MODE
and should be ignored for analysis
Correlation | 0.941832096 |
---|
LIVINGAPARTMENTS_MEDI
Highly correlated
This variable is highly correlated with LIVINGAPARTMENTS_AVG
and should be ignored for analysis
Correlation | 0.9973914912 |
---|
LIVINGAPARTMENTS_MODE
Highly correlated
This variable is highly correlated with LIVINGAPARTMENTS_MEDI
and should be ignored for analysis
Correlation | 0.9700264425 |
---|
LIVINGAREA_AVG
Highly correlated
This variable is highly correlated with LIVINGAPARTMENTS_MEDI
and should be ignored for analysis
Correlation | 0.9004271393 |
---|
LIVINGAREA_MEDI
Highly correlated
This variable is highly correlated with LIVINGAREA_AVG
and should be ignored for analysis
Correlation | 0.9966418413 |
---|
LIVINGAREA_MODE
Highly correlated
This variable is highly correlated with LIVINGAREA_MEDI
and should be ignored for analysis
Correlation | 0.9719932992 |
---|
NAME_CONTRACT_TYPE
Categorical
Distinct count | 2 |
---|---|
Unique (%) | < 0.1% |
Missing (%) | 0.0% |
Missing (n) | 0 |
Cash loans | |
---|---|
Revolving loans | 97 |
Value | Count | Frequency (%) | |
Cash loans | 9903 | 99.0% | |
Revolving loans | 97 | 1.0% |
Max length | 15 |
---|---|
Mean length | 10.0485 |
Min length | 10 |
Contains chars | True |
Contains digits | False |
Contains spaces | True |
Contains non-words | True |
NAME_EDUCATION_TYPE
Categorical
Distinct count | 5 |
---|---|
Unique (%) | < 0.1% |
Missing (%) | 0.0% |
Missing (n) | 0 |
Secondary / secondary special | |
---|---|
Higher education | |
Incomplete higher | 338 |
Other values (2) | 102 |
Value | Count | Frequency (%) | |
Secondary / secondary special | 6920 | 69.2% | |
Higher education | 2640 | 26.4% | |
Incomplete higher | 338 | 3.4% | |
Lower secondary | 94 | 0.9% | |
Academic degree | 8 | 0.1% |
Max length | 29 |
---|---|
Mean length | 25.0196 |
Min length | 15 |
Contains chars | True |
Contains digits | False |
Contains spaces | True |
Contains non-words | True |
NAME_FAMILY_STATUS
Categorical
Distinct count | 5 |
---|---|
Unique (%) | < 0.1% |
Missing (%) | 0.0% |
Missing (n) | 0 |
Married | |
---|---|
Single / not married | |
Civil marriage | 829 |
Other values (2) | 1068 |
Value | Count | Frequency (%) | |
Married | 6655 | 66.5% | |
Single / not married | 1448 | 14.5% | |
Civil marriage | 829 | 8.3% | |
Separated | 647 | 6.5% | |
Widow | 421 | 4.2% |
Max length | 20 |
---|---|
Mean length | 9.5079 |
Min length | 5 |
Contains chars | True |
Contains digits | False |
Contains spaces | True |
Contains non-words | True |
NAME_HOUSING_TYPE
Categorical
Distinct count | 6 |
---|---|
Unique (%) | 0.1% |
Missing (%) | 0.0% |
Missing (n) | 0 |
House / apartment | |
---|---|
With parents | 484 |
Municipal apartment | 348 |
Other values (3) | 237 |
Value | Count | Frequency (%) | |
House / apartment | 8931 | 89.3% | |
With parents | 484 | 4.8% | |
Municipal apartment | 348 | 3.5% | |
Rented apartment | 118 | 1.2% | |
Office apartment | 86 | 0.9% | |
Co-op apartment | 33 | 0.3% |
Max length | 19 |
---|---|
Mean length | 16.8006 |
Min length | 12 |
Contains chars | True |
Contains digits | False |
Contains spaces | True |
Contains non-words | True |
NAME_INCOME_TYPE
Categorical
Distinct count | 5 |
---|---|
Unique (%) | < 0.1% |
Missing (%) | 0.0% |
Missing (n) | 0 |
Working | |
---|---|
Commercial associate | |
Pensioner | |
Other values (2) | 718 |
Value | Count | Frequency (%) | |
Working | 5141 | 51.4% | |
Commercial associate | 2270 | 22.7% | |
Pensioner | 1871 | 18.7% | |
State servant | 717 | 7.2% | |
Businessman | 1 | < 0.1% |
Max length | 20 |
---|---|
Mean length | 10.7558 |
Min length | 7 |
Contains chars | True |
Contains digits | False |
Contains spaces | True |
Contains non-words | True |
NAME_TYPE_SUITE
Categorical
Distinct count | 8 |
---|---|
Unique (%) | 0.1% |
Missing (%) | 1.8% |
Missing (n) | 180 |
Unaccompanied | |
---|---|
Family | 1134 |
Spouse, partner | 308 |
Other values (4) | 166 |
(Missing) | 180 |
Value | Count | Frequency (%) | |
Unaccompanied | 8212 | 82.1% | |
Family | 1134 | 11.3% | |
Spouse, partner | 308 | 3.1% | |
Children | 92 | 0.9% | |
Other_B | 36 | 0.4% | |
Other_A | 24 | 0.2% | |
Group of people | 14 | 0.1% | |
(Missing) | 180 | 1.8% |
Max length | 15 |
---|---|
Mean length | 12.0086 |
Min length | 3 |
Contains chars | True |
Contains digits | False |
Contains spaces | True |
Contains non-words | True |
NONLIVINGAPARTMENTS_AVG
Numeric
Distinct count | 113 |
---|---|
Unique (%) | 1.1% |
Missing (%) | 68.6% |
Missing (n) | 6859 |
Infinite (%) | 0.0% |
Infinite (n) | 0 |
Mean | 0.009763610315 |
---|---|
Minimum | 0 |
Maximum | 1 |
Zeros (%) | 18.0% |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
Median | 0 |
Q3 | 0.0044 |
95-th percentile | 0.0309 |
Maximum | 1 |
Range | 1 |
Interquartile range | 0.0044 |
Descriptive statistics
Standard deviation | 0.05067394559 |
---|---|
Coef of variation | 5.190082762 |
Kurtosis | 222.0602677 |
Mean | 0.009763610315 |
MAD | 0.0137493301 |
Skewness | 13.6359402 |
Sum | 30.6675 |
Variance | 0.002567848761 |
Memory size | 78.2 KiB |
Value | Count | Frequency (%) | |
0 | 1804 | 18.0% | |
0.0039 | 457 | 4.6% | |
0.0077 | 209 | 2.1% | |
0.0116 | 134 | 1.3% | |
0.0154 | 91 | 0.9% | |
0.0193 | 40 | 0.4% | |
0.0232 | 40 | 0.4% | |
0.0019 | 38 | 0.4% | |
0.0309 | 30 | 0.3% | |
0.027 | 20 | 0.2% | |
Other values (102) | 278 | 2.8% | |
(Missing) | 6859 | 68.6% |
Minimum 5 values
Value | Count | Frequency (%) | |
0 | 1804 | 18.0% | |
0.0005 | 1 | < 0.1% | |
0.0007 | 1 | < 0.1% | |
0.0008 | 3 | < 0.1% | |
0.001 | 5 | 0.1% |
Maximum 5 values
Value | Count | Frequency (%) | |
1 | 3 | < 0.1% | |
0.9344 | 1 | < 0.1% | |
0.695 | 1 | < 0.1% | |
0.668 | 2 | < 0.1% | |
0.6525 | 1 | < 0.1% |
NONLIVINGAPARTMENTS_MEDI
Highly correlated
This variable is highly correlated with NONLIVINGAPARTMENTS_AVG
and should be ignored for analysis
Correlation | 0.9355157411 |
---|
NONLIVINGAPARTMENTS_MODE
Highly correlated
This variable is highly correlated with NONLIVINGAPARTMENTS_MEDI
and should be ignored for analysis
Correlation | 0.9507902268 |
---|
NONLIVINGAREA_AVG
Numeric
Distinct count | 1062 |
---|---|
Unique (%) | 10.6% |
Missing (%) | 53.5% |
Missing (n) | 5354 |
Infinite (%) | 0.0% |
Infinite (n) | 0 |
Mean | 0.03025959966 |
---|---|
Minimum | 0 |
Maximum | 1 |
Zeros (%) | 19.4% |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
Median | 0.0036 |
Q3 | 0.0283 |
95-th percentile | 0.13975 |
Maximum | 1 |
Range | 1 |
Interquartile range | 0.0283 |
Descriptive statistics
Standard deviation | 0.07449848764 |
---|---|
Coef of variation | 2.461978628 |
Kurtosis | 50.33140812 |
Mean | 0.03025959966 |
MAD | 0.03874996452 |
Skewness | 5.916302915 |
Sum | 140.5861 |
Variance | 0.00555002466 |
Memory size | 78.2 KiB |
Value | Count | Frequency (%) | |
0 | 1936 | 19.4% | |
0.0012 | 21 | 0.2% | |
0.0023 | 20 | 0.2% | |
0.0011 | 19 | 0.2% | |
0.0018 | 19 | 0.2% | |
0.0021 | 18 | 0.2% | |
0.0045 | 17 | 0.2% | |
0.0024 | 16 | 0.2% | |
0.0039 | 16 | 0.2% | |
0.0022 | 16 | 0.2% | |
Other values (1051) | 2548 | 25.5% | |
(Missing) | 5354 | 53.5% |
Minimum 5 values
Value | Count | Frequency (%) | |
0 | 1936 | 19.4% | |
0.0001 | 7 | 0.1% | |
0.0002 | 5 | 0.1% | |
0.0003 | 7 | 0.1% | |
0.0004 | 5 | 0.1% |
Maximum 5 values
Value | Count | Frequency (%) | |
1 | 4 | < 0.1% | |
0.8651 | 1 | < 0.1% | |
0.8431 | 1 | < 0.1% | |
0.7837 | 1 | < 0.1% | |
0.7461 | 1 | < 0.1% |
NONLIVINGAREA_MEDI
Highly correlated
This variable is highly correlated with NONLIVINGAREA_AVG
and should be ignored for analysis
Correlation | 0.994807585 |
---|
NONLIVINGAREA_MODE
Highly correlated
This variable is highly correlated with NONLIVINGAREA_MEDI
and should be ignored for analysis
Correlation | 0.988191842 |
---|
OBS_30_CNT_SOCIAL_CIRCLE
Numeric
Distinct count | 22 |
---|---|
Unique (%) | 0.2% |
Missing (%) | 0.1% |
Missing (n) | 7 |
Infinite (%) | 0.0% |
Infinite (n) | 0 |
Mean | 1.425998199 |
---|---|
Minimum | 0 |
Maximum | 20 |
Zeros (%) | 53.7% |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
Median | 0 |
Q3 | 2 |
95-th percentile | 6 |
Maximum | 20 |
Range | 20 |
Interquartile range | 2 |
Descriptive statistics
Standard deviation | 2.345112409 |
---|---|
Coef of variation | 1.644540933 |
Kurtosis | 9.018291802 |
Mean | 1.425998199 |
MAD | 1.662900746 |
Skewness | 2.582523177 |
Sum | 14250 |
Variance | 5.499552209 |
Memory size | 78.2 KiB |
Value | Count | Frequency (%) | |
0 | 5371 | 53.7% | |
1 | 1525 | 15.2% | |
2 | 1024 | 10.2% | |
3 | 674 | 6.7% | |
4 | 464 | 4.6% | |
5 | 286 | 2.9% | |
6 | 193 | 1.9% | |
7 | 136 | 1.4% | |
8 | 102 | 1.0% | |
9 | 71 | 0.7% | |
Other values (11) | 147 | 1.5% |
Minimum 5 values
Value | Count | Frequency (%) | |
0 | 5371 | 53.7% | |
1 | 1525 | 15.2% | |
2 | 1024 | 10.2% | |
3 | 674 | 6.7% | |
4 | 464 | 4.6% |
Maximum 5 values
Value | Count | Frequency (%) | |
20 | 4 | < 0.1% | |
19 | 2 | < 0.1% | |
18 | 3 | < 0.1% | |
17 | 4 | < 0.1% | |
16 | 5 | 0.1% |
OBS_60_CNT_SOCIAL_CIRCLE
Highly correlated
This variable is highly correlated with OBS_30_CNT_SOCIAL_CIRCLE
and should be ignored for analysis
Correlation | 0.9990456696 |
---|
OCCUPATION_TYPE
Categorical
Distinct count | 19 |
---|---|
Unique (%) | 0.2% |
Missing (%) | 31.6% |
Missing (n) | 3162 |
Laborers | |
---|---|
Sales staff | |
Core staff | |
Other values (15) | |
(Missing) |
Value | Count | Frequency (%) | |
Laborers | 1787 | 17.9% | |
Sales staff | 1067 | 10.7% | |
Core staff | 946 | 9.5% | |
Managers | 724 | 7.2% | |
Drivers | 545 | 5.5% | |
Accountants | 361 | 3.6% | |
High skill tech staff | 343 | 3.4% | |
Medicine staff | 265 | 2.6% | |
Cooking staff | 195 | 1.9% | |
Security staff | 183 | 1.8% | |
Other values (8) | 422 | 4.2% | |
(Missing) | 3162 | 31.6% |
Max length | 21 |
---|---|
Mean length | 8.1003 |
Min length | 3 |
Contains chars | True |
Contains digits | False |
Contains spaces | True |
Contains non-words | True |
ORGANIZATION_TYPE
Categorical
Distinct count | 57 |
---|---|
Unique (%) | 0.6% |
Missing (%) | 0.0% |
Missing (n) | 0 |
Business Entity Type 3 | |
---|---|
XNA | |
Self-employed | |
Other values (54) |
Value | Count | Frequency (%) | |
Business Entity Type 3 | 2190 | 21.9% | |
XNA | 1871 | 18.7% | |
Self-employed | 1158 | 11.6% | |
Other | 575 | 5.8% | |
Medicine | 337 | 3.4% | |
Business Entity Type 2 | 332 | 3.3% | |
Government | 322 | 3.2% | |
Trade: type 7 | 291 | 2.9% | |
School | 270 | 2.7% | |
Kindergarten | 221 | 2.2% | |
Other values (47) | 2433 | 24.3% |
Max length | 22 |
---|---|
Mean length | 12.4132 |
Min length | 3 |
Contains chars | True |
Contains digits | True |
Contains spaces | True |
Contains non-words | True |
OWN_CAR_AGE
Numeric
Distinct count | 47 |
---|---|
Unique (%) | 0.5% |
Missing (%) | 67.0% |
Missing (n) | 6702 |
Infinite (%) | 0.0% |
Infinite (n) | 0 |
Mean | 11.73620376 |
---|---|
Minimum | 0 |
Maximum | 65 |
Zeros (%) | 0.2% |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 2 |
Q1 | 5 |
Median | 9 |
Q3 | 15 |
95-th percentile | 27 |
Maximum | 65 |
Range | 65 |
Interquartile range | 10 |
Descriptive statistics
Standard deviation | 11.28596092 |
---|---|
Coef of variation | 0.9616364161 |
Kurtosis | 11.00097078 |
Mean | 11.73620376 |
MAD | 7.223059274 |
Skewness | 2.938736973 |
Sum | 38706 |
Variance | 127.3729139 |
Memory size | 78.2 KiB |
Value | Count | Frequency (%) | |
4 | 260 | 2.6% | |
7 | 246 | 2.5% | |
8 | 213 | 2.1% | |
2 | 210 | 2.1% | |
3 | 207 | 2.1% | |
5 | 172 | 1.7% | |
15 | 161 | 1.6% | |
9 | 159 | 1.6% | |
10 | 152 | 1.5% | |
14 | 148 | 1.5% | |
Other values (36) | 1370 | 13.7% | |
(Missing) | 6702 | 67.0% |
Minimum 5 values
Value | Count | Frequency (%) | |
0 | 18 | 0.2% | |
1 | 128 | 1.3% | |
2 | 210 | 2.1% | |
3 | 207 | 2.1% | |
4 | 260 | 2.6% |
Maximum 5 values
Value | Count | Frequency (%) | |
65 | 89 | 0.9% | |
52 | 1 | < 0.1% | |
44 | 2 | < 0.1% | |
43 | 1 | < 0.1% | |
42 | 2 | < 0.1% |
REG_CITY_NOT_LIVE_CITY
Boolean
Distinct count | 2 |
---|---|
Unique (%) | < 0.1% |
Missing (%) | 0.0% |
Missing (n) | 0 |
0 | |
---|---|
1 | 752 |
Value | Count | Frequency (%) | |
0 | 9248 | 92.5% | |
1 | 752 | 7.5% |
REG_CITY_NOT_WORK_CITY
Boolean
Distinct count | 2 |
---|---|
Unique (%) | < 0.1% |
Missing (%) | 0.0% |
Missing (n) | 0 |
0 | |
---|---|
1 |
Value | Count | Frequency (%) | |
0 | 7775 | 77.8% | |
1 | 2225 | 22.2% |
REG_REGION_NOT_LIVE_REGION
Boolean
Distinct count | 2 |
---|---|
Unique (%) | < 0.1% |
Missing (%) | 0.0% |
Missing (n) | 0 |
0 | |
---|---|
1 | 175 |
Value | Count | Frequency (%) | |
0 | 9825 | 98.2% | |
1 | 175 | 1.8% |
REG_REGION_NOT_WORK_REGION
Boolean
Distinct count | 2 |
---|---|
Unique (%) | < 0.1% |
Missing (%) | 0.0% |
Missing (n) | 0 |
0 | |
---|---|
1 | 532 |
Value | Count | Frequency (%) | |
0 | 9468 | 94.7% | |
1 | 532 | 5.3% |
REGION_POPULATION_RELATIVE
Numeric
Distinct count | 80 |
---|---|
Unique (%) | 0.8% |
Missing (%) | 0.0% |
Missing (n) | 0 |
Infinite (%) | 0.0% |
Infinite (n) | 0 |
Mean | 0.0212698105 |
---|---|
Minimum | 0.000253 |
Maximum | 0.072508 |
Zeros (%) | 0.0% |
Quantile statistics
Minimum | 0.000253 |
---|---|
5-th percentile | 0.00496 |
Q1 | 0.010006 |
Median | 0.01885 |
Q3 | 0.028663 |
95-th percentile | 0.04622 |
Maximum | 0.072508 |
Range | 0.072255 |
Interquartile range | 0.018657 |
Descriptive statistics
Standard deviation | 0.01435711211 |
---|---|
Coef of variation | 0.674999531 |
Kurtosis | 2.917575498 |
Mean | 0.0212698105 |
MAD | 0.01072462164 |
Skewness | 1.465306823 |
Sum | 212.698105 |
Variance | 0.0002061266682 |
Memory size | 78.2 KiB |
Value | Count | Frequency (%) | |
0.035792 | 574 | 5.7% | |
0.04622 | 518 | 5.2% | |
0.030755 | 415 | 4.2% | |
0.026392 | 382 | 3.8% | |
0.028663 | 368 | 3.7% | |
0.031329 | 315 | 3.1% | |
0.025164 | 312 | 3.1% | |
0.072508 | 310 | 3.1% | |
0.019101 | 301 | 3.0% | |
0.020713 | 280 | 2.8% | |
Other values (70) | 6225 | 62.3% |
Minimum 5 values
Value | Count | Frequency (%) | |
0.000253 | 1 | < 0.1% | |
0.000938 | 2 | < 0.1% | |
0.001276 | 10 | 0.1% | |
0.001333 | 7 | 0.1% | |
0.001417 | 14 | 0.1% |
Maximum 5 values
Value | Count | Frequency (%) | |
0.072508 | 310 | 3.1% | |
0.04622 | 518 | 5.2% | |
0.035792 | 574 | 5.7% | |
0.032561 | 216 | 2.2% | |
0.031329 | 315 | 3.1% |
REGION_RATING_CLIENT
Categorical
Distinct count | 3 |
---|---|
Unique (%) | < 0.1% |
Missing (%) | 0.0% |
Missing (n) | 0 |
2 | |
---|---|
3 | |
1 | 1157 |
Value | Count | Frequency (%) | |
2 | 7212 | 72.1% | |
3 | 1631 | 16.3% | |
1 | 1157 | 11.6% |
Max length | 1 |
---|---|
Mean length | 1 |
Min length | 1 |
Contains chars | False |
Contains digits | True |
Contains spaces | False |
Contains non-words | False |
REGION_RATING_CLIENT_W_CITY
Highly correlated
This variable is highly correlated with REGION_RATING_CLIENT
and should be ignored for analysis
Correlation | 0.9323217636 |
---|
SK_ID_CURR
Numeric
Distinct count | 10000 |
---|---|
Unique (%) | 100.0% |
Missing (%) | 0.0% |
Missing (n) | 0 |
Infinite (%) | 0.0% |
Infinite (n) | 0 |
Mean | 278712.356 |
---|---|
Minimum | 100013 |
Maximum | 456115 |
Zeros (%) | 0.0% |
Quantile statistics
Minimum | 100013 |
---|---|
5-th percentile | 116340.55 |
Q1 | 190381.25 |
Median | 279269 |
Q3 | 367477.5 |
95-th percentile | 439071.3 |
Maximum | 456115 |
Range | 356102 |
Interquartile range | 177096.25 |
Descriptive statistics
Standard deviation | 102988.1352 |
---|---|
Coef of variation | 0.3695140636 |
Kurtosis | -1.193923752 |
Mean | 278712.356 |
MAD | 89156.02613 |
Skewness | -0.009692016528 |
Sum | 2787123560 |
Variance | 1.0606556e+10 |
Memory size | 78.2 KiB |
Value | Count | Frequency (%) | |
346111 | 1 | < 0.1% | |
157903 | 1 | < 0.1% | |
215791 | 1 | < 0.1% | |
174677 | 1 | < 0.1% | |
241116 | 1 | < 0.1% | |
255515 | 1 | < 0.1% | |
228661 | 1 | < 0.1% | |
242996 | 1 | < 0.1% | |
183603 | 1 | < 0.1% | |
386350 | 1 | < 0.1% | |
Other values (9990) | 9990 | 99.9% |
Minimum 5 values
Value | Count | Frequency (%) | |
100013 | 1 | < 0.1% | |
100028 | 1 | < 0.1% | |
100066 | 1 | < 0.1% | |
100067 | 1 | < 0.1% | |
100090 | 1 | < 0.1% |
Maximum 5 values
Value | Count | Frequency (%) | |
456115 | 1 | < 0.1% | |
456008 | 1 | < 0.1% | |
455955 | 1 | < 0.1% | |
455940 | 1 | < 0.1% | |
455907 | 1 | < 0.1% |
TOTALAREA_MODE
Highly correlated
This variable is highly correlated with LIVINGAREA_MODE
and should be ignored for analysis
Correlation | 0.9013424924 |
---|
WALLSMATERIAL_MODE
Categorical
Distinct count | 8 |
---|---|
Unique (%) | 0.1% |
Missing (%) | 48.8% |
Missing (n) | 4882 |
Panel | |
---|---|
Stone, brick | |
Block | 287 |
Other values (4) | 363 |
(Missing) |
Value | Count | Frequency (%) | |
Panel | 2360 | 23.6% | |
Stone, brick | 2108 | 21.1% | |
Block | 287 | 2.9% | |
Wooden | 160 | 1.6% | |
Mixed | 82 | 0.8% | |
Monolithic | 63 | 0.6% | |
Others | 58 | 0.6% | |
(Missing) | 4882 | 48.8% |
Max length | 12 |
---|---|
Mean length | 5.5525 |
Min length | 3 |
Contains chars | True |
Contains digits | False |
Contains spaces | True |
Contains non-words | True |
WEEKDAY_APPR_PROCESS_START
Categorical
Distinct count | 7 |
---|---|
Unique (%) | 0.1% |
Missing (%) | 0.0% |
Missing (n) | 0 |
TUESDAY | |
---|---|
WEDNESDAY | |
THURSDAY | |
Other values (4) |
Value | Count | Frequency (%) | |
TUESDAY | 2010 | 20.1% | |
WEDNESDAY | 1805 | 18.1% | |
THURSDAY | 1750 | 17.5% | |
MONDAY | 1610 | 16.1% | |
FRIDAY | 1460 | 14.6% | |
SATURDAY | 974 | 9.7% | |
SUNDAY | 391 | 3.9% |
Max length | 9 |
---|---|
Mean length | 7.2873 |
Min length | 6 |
Contains chars | True |
Contains digits | False |
Contains spaces | False |
Contains non-words | False |
YEARS_BEGINEXPLUATATION_AVG
Numeric
Distinct count | 111 |
---|---|
Unique (%) | 1.1% |
Missing (%) | 46.6% |
Missing (n) | 4656 |
Infinite (%) | 0.0% |
Infinite (n) | 0 |
Mean | 0.9783374626 |
---|---|
Minimum | 0 |
Maximum | 0.9995 |
Zeros (%) | 0.2% |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0.9697 |
Q1 | 0.9767 |
Median | 0.9816 |
Q3 | 0.9866 |
95-th percentile | 0.996 |
Maximum | 0.9995 |
Range | 0.9995 |
Interquartile range | 0.0099 |
Descriptive statistics
Standard deviation | 0.05589494899 |
---|---|
Coef of variation | 0.05713258577 |
Kurtosis | 295.9636266 |
Mean | 0.9783374626 |
MAD | 0.009875058799 |
Skewness | -17.06666603 |
Sum | 5228.2354 |
Variance | 0.003124245323 |
Memory size | 78.2 KiB |
Value | Count | Frequency (%) | |
0.9871 | 149 | 1.5% | |
0.9866 | 143 | 1.4% | |
0.9826 | 143 | 1.4% | |
0.9856 | 142 | 1.4% | |
0.9816 | 142 | 1.4% | |
0.9811 | 138 | 1.4% | |
0.9861 | 137 | 1.4% | |
0.9846 | 137 | 1.4% | |
0.9791 | 137 | 1.4% | |
0.9806 | 136 | 1.4% | |
Other values (100) | 3940 | 39.4% | |
(Missing) | 4656 | 46.6% |
Minimum 5 values
Value | Count | Frequency (%) | |
0 | 17 | 0.2% | |
0.9165 | 1 | < 0.1% | |
0.921 | 1 | < 0.1% | |
0.9225 | 1 | < 0.1% | |
0.9275 | 1 | < 0.1% |
Maximum 5 values
Value | Count | Frequency (%) | |
0.9995 | 20 | 0.2% | |
0.999 | 36 | 0.4% | |
0.9985 | 36 | 0.4% | |
0.998 | 41 | 0.4% | |
0.9975 | 41 | 0.4% |
YEARS_BEGINEXPLUATATION_MEDI
Highly correlated
This variable is highly correlated with YEARS_BEGINEXPLUATATION_AVG
and should be ignored for analysis
Correlation | 0.9999688217 |
---|
YEARS_BEGINEXPLUATATION_MODE
Highly correlated
This variable is highly correlated with YEARS_BEGINEXPLUATATION_MEDI
and should be ignored for analysis
Correlation | 0.999758927 |
---|
YEARS_BUILD_AVG
Highly correlated
This variable is highly correlated with YEARS_BEGINEXPLUATATION_MODE
and should be ignored for analysis
Correlation | 0.9612937056 |
---|
YEARS_BUILD_MEDI
Highly correlated
This variable is highly correlated with YEARS_BUILD_AVG
and should be ignored for analysis
Correlation | 0.9989002843 |
---|
YEARS_BUILD_MODE
Highly correlated
This variable is highly correlated with YEARS_BUILD_MEDI
and should be ignored for analysis
Correlation | 0.992116652 |
---|
First rows
AMT_ANNUITY | AMT_CREDIT | AMT_GOODS_PRICE | AMT_INCOME_TOTAL | AMT_REQ_CREDIT_BUREAU_DAY | AMT_REQ_CREDIT_BUREAU_HOUR | AMT_REQ_CREDIT_BUREAU_MON | AMT_REQ_CREDIT_BUREAU_QRT | AMT_REQ_CREDIT_BUREAU_WEEK | AMT_REQ_CREDIT_BUREAU_YEAR | APARTMENTS_AVG | APARTMENTS_MEDI | APARTMENTS_MODE | BASEMENTAREA_AVG | BASEMENTAREA_MEDI | BASEMENTAREA_MODE | CNT_CHILDREN | CNT_FAM_MEMBERS | CODE_GENDER | COMMONAREA_AVG | COMMONAREA_MEDI | COMMONAREA_MODE | DAYS_BIRTH | DAYS_EMPLOYED | DAYS_ID_PUBLISH | DAYS_LAST_PHONE_CHANGE | DAYS_REGISTRATION | DEF_30_CNT_SOCIAL_CIRCLE | DEF_60_CNT_SOCIAL_CIRCLE | ELEVATORS_AVG | ELEVATORS_MEDI | ELEVATORS_MODE | EMERGENCYSTATE_MODE | ENTRANCES_AVG | ENTRANCES_MEDI | ENTRANCES_MODE | EXT_SOURCE_1 | EXT_SOURCE_2 | EXT_SOURCE_3 | FLAG_CONT_MOBILE | FLAG_DOCUMENT_10 | FLAG_DOCUMENT_11 | FLAG_DOCUMENT_12 | FLAG_DOCUMENT_13 | FLAG_DOCUMENT_14 | FLAG_DOCUMENT_15 | FLAG_DOCUMENT_16 | FLAG_DOCUMENT_17 | FLAG_DOCUMENT_18 | FLAG_DOCUMENT_19 | FLAG_DOCUMENT_2 | FLAG_DOCUMENT_20 | FLAG_DOCUMENT_21 | FLAG_DOCUMENT_3 | FLAG_DOCUMENT_4 | FLAG_DOCUMENT_5 | FLAG_DOCUMENT_6 | FLAG_DOCUMENT_7 | FLAG_DOCUMENT_8 | FLAG_DOCUMENT_9 | FLAG_EMAIL | FLAG_EMP_PHONE | FLAG_MOBIL | FLAG_OWN_CAR | FLAG_OWN_REALTY | FLAG_PHONE | FLAG_WORK_PHONE | FLOORSMAX_AVG | FLOORSMAX_MEDI | FLOORSMAX_MODE | FLOORSMIN_AVG | FLOORSMIN_MEDI | FLOORSMIN_MODE | FONDKAPREMONT_MODE | HOUR_APPR_PROCESS_START | HOUSETYPE_MODE | LANDAREA_AVG | LANDAREA_MEDI | LANDAREA_MODE | LIVE_CITY_NOT_WORK_CITY | LIVE_REGION_NOT_WORK_REGION | LIVINGAPARTMENTS_AVG | LIVINGAPARTMENTS_MEDI | LIVINGAPARTMENTS_MODE | LIVINGAREA_AVG | LIVINGAREA_MEDI | LIVINGAREA_MODE | NAME_CONTRACT_TYPE | NAME_EDUCATION_TYPE | NAME_FAMILY_STATUS | NAME_HOUSING_TYPE | NAME_INCOME_TYPE | NAME_TYPE_SUITE | NONLIVINGAPARTMENTS_AVG | NONLIVINGAPARTMENTS_MEDI | NONLIVINGAPARTMENTS_MODE | NONLIVINGAREA_AVG | NONLIVINGAREA_MEDI | NONLIVINGAREA_MODE | OBS_30_CNT_SOCIAL_CIRCLE | OBS_60_CNT_SOCIAL_CIRCLE | OCCUPATION_TYPE | ORGANIZATION_TYPE | OWN_CAR_AGE | REG_CITY_NOT_LIVE_CITY | REG_CITY_NOT_WORK_CITY | REG_REGION_NOT_LIVE_REGION | REG_REGION_NOT_WORK_REGION | REGION_POPULATION_RELATIVE | REGION_RATING_CLIENT | REGION_RATING_CLIENT_W_CITY | SK_ID_CURR | TOTALAREA_MODE | WALLSMATERIAL_MODE | WEEKDAY_APPR_PROCESS_START | YEARS_BEGINEXPLUATATION_AVG | YEARS_BEGINEXPLUATATION_MEDI | YEARS_BEGINEXPLUATATION_MODE | YEARS_BUILD_AVG | YEARS_BUILD_MEDI | YEARS_BUILD_MODE | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | 33610.5 | 547272.0 | 495000.0 | 135000.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 5.0 | 0.0082 | 0.0083 | 0.0084 | 0.000 | 0.000 | 0.0000 | 0 | 2 | F | 0.0011 | 0.0011 | 0.0011 | -20433 | -947 | -1732 | -1169 | -205 | 1.0 | 1.0 | 0.00 | 0.00 | 0.0000 | No | 0.0690 | 0.0690 | 0.0690 | 0.664626 | 0.230055 | 0.504681 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | N | Y | 1 | 0 | 0.0417 | 0.0417 | 0.0417 | 0.0833 | 0.0833 | 0.0833 | reg oper account | 7 | block of flats | 0.0161 | 0.0163 | 0.0164 | 0 | 0 | 0.0067 | 0.0068 | 0.0073 | 0.0073 | 0.0075 | 0.0077 | Cash loans | Secondary / secondary special | Married | House / apartment | Working | Unaccompanied | 0.0 | 0.0 | 0.0 | 0.000 | 0.0000 | 0.0000 | 10.0 | 10.0 | Security staff | Security | NaN | 0 | 0 | 0 | 0 | 0.018801 | 2 | 2 | 324271 | 0.0064 | Wooden | SATURDAY | 0.9687 | 0.9687 | 0.9687 | 0.5716 | 0.5773 | 0.5884 |
1 | 24696.0 | 481176.0 | 360000.0 | 90000.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 2.0 | 0.0577 | 0.0583 | 0.0588 | 0.000 | 0.000 | 0.0000 | 0 | 2 | F | 0.0034 | 0.0034 | 0.0034 | -19475 | -3768 | -3010 | -192 | -3423 | 0.0 | 0.0 | 0.00 | 0.00 | 0.0000 | No | 0.0690 | 0.0690 | 0.0690 | NaN | 0.161084 | 0.215182 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | Y | Y | 0 | 0 | 0.1250 | 0.1250 | 0.1250 | 0.1667 | 0.1667 | 0.1667 | reg oper account | 14 | block of flats | 0.0000 | 0.0000 | 0.0000 | 0 | 0 | 0.0471 | 0.0479 | 0.0514 | 0.0259 | 0.0263 | 0.0270 | Cash loans | Secondary / secondary special | Married | House / apartment | Working | Unaccompanied | 0.0 | 0.0 | 0.0 | 0.000 | 0.0000 | 0.0000 | 0.0 | 0.0 | Laborers | Business Entity Type 1 | 4.0 | 0 | 0 | 0 | 0 | 0.019689 | 2 | 2 | 211635 | 0.0219 | Stone, brick | THURSDAY | 0.9727 | 0.9727 | 0.9727 | 0.6260 | 0.6310 | 0.6406 |
2 | 40428.0 | 383760.0 | 360000.0 | 157500.0 | 0.0 | 0.0 | 0.0 | 0.0 | 1.0 | 5.0 | NaN | NaN | NaN | NaN | NaN | NaN | 1 | 2 | F | NaN | NaN | NaN | -9679 | -325 | -2343 | -197 | -4444 | 0.0 | 0.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 0.423490 | 0.110020 | 0.397946 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | 1 | N | N | 0 | 0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 11 | NaN | NaN | NaN | NaN | 0 | 0 | NaN | NaN | NaN | NaN | NaN | NaN | Cash loans | Higher education | Separated | House / apartment | Working | Family | NaN | NaN | NaN | NaN | NaN | NaN | 7.0 | 7.0 | Core staff | Kindergarten | NaN | 1 | 1 | 1 | 1 | 0.007120 | 2 | 2 | 185314 | NaN | NaN | SATURDAY | NaN | NaN | NaN | NaN | NaN | NaN |
3 | 7218.0 | 67500.0 | 67500.0 | 76500.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0082 | 0.0083 | 0.0084 | NaN | NaN | NaN | 0 | 1 | F | NaN | NaN | NaN | -24215 | 365243 | -4481 | -317 | -11139 | 0.0 | 0.0 | 0.00 | 0.00 | 0.0000 | No | 0.0345 | 0.0345 | 0.0345 | NaN | 0.571534 | NaN | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | N | Y | 0 | 0 | 0.0417 | 0.0417 | 0.0417 | NaN | NaN | NaN | NaN | 13 | block of flats | 0.0045 | 0.0046 | 0.0046 | 0 | 0 | NaN | NaN | NaN | 0.0043 | 0.0044 | 0.0045 | Cash loans | Secondary / secondary special | Widow | House / apartment | Pensioner | Unaccompanied | NaN | NaN | NaN | 0.011 | 0.0112 | 0.0116 | 0.0 | 0.0 | NaN | XNA | NaN | 0 | 0 | 0 | 0 | 0.014520 | 2 | 2 | 233970 | 0.0058 | Block | WEDNESDAY | 0.9821 | 0.9821 | 0.9821 | NaN | NaN | NaN |
4 | 74416.5 | 1800000.0 | 1800000.0 | 139500.0 | 0.0 | 0.0 | 0.0 | 2.0 | 0.0 | 0.0 | NaN | NaN | NaN | NaN | NaN | NaN | 0 | 2 | F | NaN | NaN | NaN | -20701 | -3828 | -3933 | -3052 | -6860 | 0.0 | 0.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 0.773392 | 0.629777 | 0.694093 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | 1 | N | N | 0 | 0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 10 | NaN | NaN | NaN | NaN | 0 | 0 | NaN | NaN | NaN | NaN | NaN | NaN | Cash loans | Secondary / secondary special | Married | House / apartment | Commercial associate | Children | NaN | NaN | NaN | NaN | NaN | NaN | 0.0 | 0.0 | Core staff | Trade: type 7 | NaN | 0 | 0 | 0 | 0 | 0.035792 | 2 | 2 | 224300 | NaN | NaN | TUESDAY | NaN | NaN | NaN | NaN | NaN | NaN |
5 | 29493.0 | 817560.0 | 675000.0 | 94500.0 | 0.0 | 0.0 | 0.0 | 1.0 | 0.0 | 2.0 | NaN | NaN | NaN | NaN | NaN | NaN | 0 | 2 | F | NaN | NaN | NaN | -17327 | -3418 | -888 | -1767 | -9679 | 0.0 | 0.0 | NaN | NaN | NaN | Yes | 0.0690 | 0.0690 | 0.0690 | NaN | 0.535336 | 0.768808 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | N | N | 0 | 0 | 0.0417 | 0.0417 | 0.0417 | NaN | NaN | NaN | NaN | 20 | block of flats | 0.0168 | 0.0165 | 0.0161 | 1 | 0 | NaN | NaN | NaN | 0.0116 | 0.0118 | 0.0120 | Cash loans | Secondary / secondary special | Married | House / apartment | Working | Unaccompanied | NaN | NaN | NaN | NaN | NaN | NaN | 0.0 | 0.0 | Laborers | Business Entity Type 3 | NaN | 0 | 1 | 0 | 0 | 0.035792 | 2 | 2 | 204673 | 0.0099 | Wooden | SUNDAY | 0.9573 | 0.9573 | 0.9573 | NaN | NaN | NaN |
6 | 16830.0 | 140746.5 | 121500.0 | 112500.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 2.0 | NaN | NaN | NaN | NaN | NaN | NaN | 1 | 3 | M | NaN | NaN | NaN | -8109 | -139 | -753 | -1422 | -8074 | 0.0 | 0.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 0.367065 | 0.054565 | 0.397946 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 1 | 1 | Y | Y | 0 | 0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 12 | NaN | NaN | NaN | NaN | 0 | 0 | NaN | NaN | NaN | NaN | NaN | NaN | Cash loans | Secondary / secondary special | Married | House / apartment | Working | Unaccompanied | NaN | NaN | NaN | NaN | NaN | NaN | 1.0 | 1.0 | Drivers | Industry: type 3 | 2.0 | 0 | 0 | 0 | 0 | 0.016612 | 2 | 2 | 180598 | NaN | NaN | SATURDAY | NaN | NaN | NaN | NaN | NaN | NaN |
7 | 62964.0 | 983160.0 | 900000.0 | 216000.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 3.0 | 0.2948 | 0.2977 | 0.3004 | 0.175 | 0.175 | 0.1816 | 1 | 2 | F | 0.0539 | 0.0543 | 0.0544 | -12283 | -427 | -4593 | -834 | -6245 | 0.0 | 0.0 | 0.32 | 0.32 | 0.3222 | No | 0.2759 | 0.2759 | 0.2759 | 0.295576 | 0.720200 | 0.424130 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | N | Y | 0 | 0 | 0.3333 | 0.3333 | 0.3333 | 0.3750 | 0.3750 | 0.3750 | reg oper account | 12 | block of flats | NaN | NaN | NaN | 0 | 0 | 0.2404 | 0.2446 | 0.2626 | 0.3073 | 0.3129 | 0.3202 | Cash loans | Higher education | Single / not married | House / apartment | Commercial associate | Family | 0.0 | 0.0 | 0.0 | 0.000 | 0.0000 | 0.0000 | 0.0 | 0.0 | Accountants | Business Entity Type 3 | NaN | 0 | 0 | 0 | 0 | 0.046220 | 1 | 1 | 297129 | 0.2436 | Panel | SATURDAY | 0.9806 | 0.9806 | 0.9806 | 0.7348 | 0.7383 | 0.7452 |
8 | 25996.5 | 334242.0 | 279000.0 | 202500.0 | 0.0 | 0.0 | 0.0 | 3.0 | 0.0 | 4.0 | NaN | NaN | NaN | NaN | NaN | NaN | 1 | 3 | M | NaN | NaN | NaN | -13197 | -2285 | -1929 | -1024 | -3690 | 0.0 | 0.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 0.202165 | 0.299000 | 0.481249 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | N | Y | 0 | 0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 15 | NaN | NaN | NaN | NaN | 0 | 0 | NaN | NaN | NaN | NaN | NaN | NaN | Cash loans | Secondary / secondary special | Married | House / apartment | Working | Unaccompanied | NaN | NaN | NaN | NaN | NaN | NaN | 0.0 | 0.0 | Drivers | Self-employed | NaN | 0 | 0 | 0 | 0 | 0.002042 | 3 | 3 | 165624 | NaN | NaN | WEDNESDAY | NaN | NaN | NaN | NaN | NaN | NaN |
9 | 30451.5 | 543037.5 | 463500.0 | 157500.0 | 0.0 | 0.0 | 0.0 | 2.0 | 0.0 | 2.0 | NaN | NaN | NaN | NaN | NaN | NaN | 0 | 1 | F | NaN | NaN | NaN | -17467 | -4226 | -1017 | -1617 | -477 | 1.0 | 1.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 0.710845 | 0.577484 | 0.450747 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | N | Y | 1 | 0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 13 | NaN | NaN | NaN | NaN | 0 | 0 | NaN | NaN | NaN | NaN | NaN | NaN | Cash loans | Incomplete higher | Widow | Office apartment | Commercial associate | Unaccompanied | NaN | NaN | NaN | NaN | NaN | NaN | 5.0 | 5.0 | Managers | Self-employed | NaN | 0 | 0 | 0 | 0 | 0.003122 | 3 | 3 | 101728 | NaN | NaN | THURSDAY | NaN | NaN | NaN | NaN | NaN | NaN |
Last rows
AMT_ANNUITY | AMT_CREDIT | AMT_GOODS_PRICE | AMT_INCOME_TOTAL | AMT_REQ_CREDIT_BUREAU_DAY | AMT_REQ_CREDIT_BUREAU_HOUR | AMT_REQ_CREDIT_BUREAU_MON | AMT_REQ_CREDIT_BUREAU_QRT | AMT_REQ_CREDIT_BUREAU_WEEK | AMT_REQ_CREDIT_BUREAU_YEAR | APARTMENTS_AVG | APARTMENTS_MEDI | APARTMENTS_MODE | BASEMENTAREA_AVG | BASEMENTAREA_MEDI | BASEMENTAREA_MODE | CNT_CHILDREN | CNT_FAM_MEMBERS | CODE_GENDER | COMMONAREA_AVG | COMMONAREA_MEDI | COMMONAREA_MODE | DAYS_BIRTH | DAYS_EMPLOYED | DAYS_ID_PUBLISH | DAYS_LAST_PHONE_CHANGE | DAYS_REGISTRATION | DEF_30_CNT_SOCIAL_CIRCLE | DEF_60_CNT_SOCIAL_CIRCLE | ELEVATORS_AVG | ELEVATORS_MEDI | ELEVATORS_MODE | EMERGENCYSTATE_MODE | ENTRANCES_AVG | ENTRANCES_MEDI | ENTRANCES_MODE | EXT_SOURCE_1 | EXT_SOURCE_2 | EXT_SOURCE_3 | FLAG_CONT_MOBILE | FLAG_DOCUMENT_10 | FLAG_DOCUMENT_11 | FLAG_DOCUMENT_12 | FLAG_DOCUMENT_13 | FLAG_DOCUMENT_14 | FLAG_DOCUMENT_15 | FLAG_DOCUMENT_16 | FLAG_DOCUMENT_17 | FLAG_DOCUMENT_18 | FLAG_DOCUMENT_19 | FLAG_DOCUMENT_2 | FLAG_DOCUMENT_20 | FLAG_DOCUMENT_21 | FLAG_DOCUMENT_3 | FLAG_DOCUMENT_4 | FLAG_DOCUMENT_5 | FLAG_DOCUMENT_6 | FLAG_DOCUMENT_7 | FLAG_DOCUMENT_8 | FLAG_DOCUMENT_9 | FLAG_EMAIL | FLAG_EMP_PHONE | FLAG_MOBIL | FLAG_OWN_CAR | FLAG_OWN_REALTY | FLAG_PHONE | FLAG_WORK_PHONE | FLOORSMAX_AVG | FLOORSMAX_MEDI | FLOORSMAX_MODE | FLOORSMIN_AVG | FLOORSMIN_MEDI | FLOORSMIN_MODE | FONDKAPREMONT_MODE | HOUR_APPR_PROCESS_START | HOUSETYPE_MODE | LANDAREA_AVG | LANDAREA_MEDI | LANDAREA_MODE | LIVE_CITY_NOT_WORK_CITY | LIVE_REGION_NOT_WORK_REGION | LIVINGAPARTMENTS_AVG | LIVINGAPARTMENTS_MEDI | LIVINGAPARTMENTS_MODE | LIVINGAREA_AVG | LIVINGAREA_MEDI | LIVINGAREA_MODE | NAME_CONTRACT_TYPE | NAME_EDUCATION_TYPE | NAME_FAMILY_STATUS | NAME_HOUSING_TYPE | NAME_INCOME_TYPE | NAME_TYPE_SUITE | NONLIVINGAPARTMENTS_AVG | NONLIVINGAPARTMENTS_MEDI | NONLIVINGAPARTMENTS_MODE | NONLIVINGAREA_AVG | NONLIVINGAREA_MEDI | NONLIVINGAREA_MODE | OBS_30_CNT_SOCIAL_CIRCLE | OBS_60_CNT_SOCIAL_CIRCLE | OCCUPATION_TYPE | ORGANIZATION_TYPE | OWN_CAR_AGE | REG_CITY_NOT_LIVE_CITY | REG_CITY_NOT_WORK_CITY | REG_REGION_NOT_LIVE_REGION | REG_REGION_NOT_WORK_REGION | REGION_POPULATION_RELATIVE | REGION_RATING_CLIENT | REGION_RATING_CLIENT_W_CITY | SK_ID_CURR | TOTALAREA_MODE | WALLSMATERIAL_MODE | WEEKDAY_APPR_PROCESS_START | YEARS_BEGINEXPLUATATION_AVG | YEARS_BEGINEXPLUATATION_MEDI | YEARS_BEGINEXPLUATATION_MODE | YEARS_BUILD_AVG | YEARS_BUILD_MEDI | YEARS_BUILD_MODE | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
9990 | 17127.0 | 332946.0 | 238500.0 | 135000.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 1.0 | 0.0753 | 0.0760 | 0.0767 | 0.0411 | 0.0411 | 0.0427 | 0 | 2 | M | NaN | NaN | NaN | -12449 | -319 | -4474 | 0 | -6518 | 0.0 | 0.0 | NaN | NaN | NaN | No | 0.1379 | 0.1379 | 0.1379 | NaN | 0.182276 | 0.801601 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | 1 | N | Y | 0 | 0 | 0.1667 | 0.1667 | 0.1667 | NaN | NaN | NaN | NaN | 17 | block of flats | 0.0000 | 0.0000 | 0.0000 | 0 | 0 | 0.0614 | 0.0624 | 0.0670 | 0.0704 | 0.0716 | 0.0733 | Cash loans | Higher education | Civil marriage | House / apartment | Commercial associate | Unaccompanied | NaN | NaN | NaN | 0.0271 | 0.0277 | 0.0287 | 2.0 | 2.0 | Laborers | Business Entity Type 3 | NaN | 0 | 0 | 0 | 0 | 0.007120 | 2 | 2 | 150626 | 0.0553 | Stone, brick | SATURDAY | 0.9752 | 0.9752 | 0.9752 | 0.6600 | 0.6645 | 0.6733 |
9991 | 9000.0 | 180000.0 | 180000.0 | 94500.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 5.0 | 0.0619 | 0.0625 | 0.0630 | 0.0472 | 0.0472 | 0.0490 | 1 | 3 | F | 0.0398 | 0.0401 | 0.0402 | -17099 | -2744 | -643 | -1923 | -4646 | 1.0 | 1.0 | 0.00 | 0.00 | 0.0000 | No | 0.1034 | 0.1034 | 0.1034 | NaN | 0.114350 | 0.205598 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | N | Y | 1 | 1 | 0.1667 | 0.1667 | 0.1667 | 0.2083 | 0.2083 | 0.2083 | reg oper account | 7 | block of flats | 0.0422 | 0.0429 | 0.0431 | 0 | 0 | 0.0504 | 0.0513 | 0.0551 | 0.0488 | 0.0497 | 0.0508 | Revolving loans | Secondary / secondary special | Married | House / apartment | Commercial associate | Unaccompanied | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 1.0 | 1.0 | Laborers | Other | NaN | 0 | 0 | 0 | 0 | 0.014464 | 2 | 2 | 121135 | 0.0384 | Stone, brick | THURSDAY | 0.9752 | 0.9752 | 0.9752 | 0.6600 | 0.6645 | 0.6733 |
9992 | 17370.0 | 164182.5 | 148500.0 | 90000.0 | 0.0 | 0.0 | 0.0 | 1.0 | 0.0 | 0.0 | 0.1649 | 0.1665 | 0.0756 | 0.0715 | 0.0715 | 0.0429 | 0 | 2 | F | 0.0755 | 0.0760 | 0.0396 | -24824 | 365243 | -5431 | -2112 | -5431 | 0.0 | 0.0 | 0.18 | 0.18 | 0.0806 | No | 0.1552 | 0.1552 | 0.0690 | 0.809290 | 0.472400 | 0.783832 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 1 | N | Y | 0 | 0 | 0.3333 | 0.3333 | 0.3333 | 0.3750 | 0.3750 | 0.3750 | reg oper account | 9 | block of flats | 0.0254 | 0.0258 | 0.0119 | 0 | 0 | 0.1311 | 0.1334 | 0.0661 | 0.1662 | 0.1692 | 0.0804 | Cash loans | Secondary / secondary special | Civil marriage | House / apartment | Pensioner | Unaccompanied | 0.0154 | 0.0155 | 0.0000 | 0.0135 | 0.0138 | 0.0000 | 11.0 | 11.0 | NaN | XNA | NaN | 0 | 0 | 0 | 0 | 0.025164 | 2 | 2 | 209420 | 0.2699 | Panel | MONDAY | 0.9866 | 0.9866 | 0.9866 | 0.8164 | 0.8189 | 0.8236 |
9993 | 25866.0 | 607500.0 | 607500.0 | 135000.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 2.0 | NaN | NaN | NaN | NaN | NaN | NaN | 1 | 3 | F | NaN | NaN | NaN | -9909 | -1227 | -2545 | 0 | -4161 | 0.0 | 0.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 0.200977 | 0.258582 | 0.661024 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | Y | Y | 1 | 0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 7 | NaN | NaN | NaN | NaN | 0 | 0 | NaN | NaN | NaN | NaN | NaN | NaN | Cash loans | Secondary / secondary special | Married | House / apartment | Working | Unaccompanied | NaN | NaN | NaN | NaN | NaN | NaN | 0.0 | 0.0 | Sales staff | Self-employed | 7.0 | 0 | 0 | 0 | 0 | 0.020246 | 3 | 3 | 263565 | NaN | NaN | FRIDAY | NaN | NaN | NaN | NaN | NaN | NaN |
9994 | 10237.5 | 198000.0 | 198000.0 | 135000.0 | 0.0 | 0.0 | 0.0 | 1.0 | 0.0 | 2.0 | 0.0928 | 0.0937 | 0.0945 | 0.0742 | 0.0742 | 0.0770 | 0 | 2 | F | 0.0118 | 0.0119 | 0.0120 | -21364 | 365243 | -4302 | -813 | -5426 | 0.0 | 0.0 | 0.00 | 0.00 | 0.0000 | No | 0.2069 | 0.2069 | 0.2069 | NaN | 0.457083 | 0.474051 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | N | Y | 0 | 0 | 0.1667 | 0.1667 | 0.1667 | 0.2083 | 0.2083 | 0.2083 | reg oper account | 12 | block of flats | 0.0579 | 0.0589 | 0.0592 | 0 | 0 | 0.0756 | 0.0770 | 0.0826 | 0.0778 | 0.0792 | 0.0810 | Cash loans | Higher education | Married | House / apartment | Pensioner | Unaccompanied | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0 | 0.0 | NaN | XNA | NaN | 0 | 0 | 0 | 0 | 0.018801 | 2 | 2 | 210157 | 0.0679 | Panel | WEDNESDAY | 0.9811 | 0.9811 | 0.9811 | 0.7416 | 0.7451 | 0.7517 |
9995 | 11308.5 | 232344.0 | 157500.0 | 99000.0 | 0.0 | 0.0 | 0.0 | 2.0 | 0.0 | 0.0 | NaN | NaN | NaN | NaN | NaN | NaN | 0 | 2 | M | NaN | NaN | NaN | -22054 | 365243 | -5352 | -557 | -5349 | 0.0 | 0.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 0.493130 | 0.332851 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 1 | Y | Y | 0 | 0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 11 | NaN | NaN | NaN | NaN | 0 | 0 | NaN | NaN | NaN | NaN | NaN | NaN | Cash loans | Secondary / secondary special | Married | House / apartment | Pensioner | Unaccompanied | NaN | NaN | NaN | NaN | NaN | NaN | 0.0 | 0.0 | NaN | XNA | 10.0 | 0 | 0 | 0 | 0 | 0.005313 | 2 | 2 | 200031 | NaN | NaN | TUESDAY | NaN | NaN | NaN | NaN | NaN | NaN |
9996 | 23908.5 | 465133.5 | 388260.0 | 135000.0 | 0.0 | 0.0 | 0.0 | 1.0 | 0.0 | 2.0 | 0.1010 | 0.1020 | 0.1029 | 0.0954 | 0.0954 | 0.0990 | 0 | 2 | F | 0.0366 | 0.0368 | 0.0369 | -19589 | 365243 | -3101 | -1634 | -2667 | 0.0 | 0.0 | 0.00 | 0.00 | 0.0000 | No | 0.2069 | 0.2069 | 0.2069 | 0.735737 | 0.350328 | 0.232725 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | Y | Y | 0 | 0 | 0.1667 | 0.1667 | 0.1667 | 0.0417 | 0.0417 | 0.0417 | reg oper account | 0 | block of flats | 0.0650 | 0.0661 | 0.0665 | 0 | 0 | 0.0782 | 0.0795 | 0.0854 | 0.0887 | 0.0903 | 0.0924 | Cash loans | Secondary / secondary special | Married | House / apartment | Pensioner | Unaccompanied | 0.0193 | 0.0194 | 0.0195 | 0.0206 | 0.0210 | 0.0218 | 0.0 | 0.0 | NaN | XNA | 16.0 | 0 | 0 | 0 | 0 | 0.018029 | 3 | 3 | 174105 | 0.0743 | Panel | FRIDAY | 0.9801 | 0.9801 | 0.9801 | 0.7280 | 0.7316 | 0.7387 |
9997 | 17770.5 | 345510.0 | 247500.0 | 135000.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 3.0 | NaN | NaN | NaN | NaN | NaN | NaN | 0 | 1 | F | NaN | NaN | NaN | -23632 | 365243 | -4572 | -796 | -11648 | 0.0 | 0.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 0.631941 | 0.713631 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 1 | N | Y | 0 | 0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 14 | NaN | NaN | NaN | NaN | 0 | 0 | NaN | NaN | NaN | NaN | NaN | NaN | Cash loans | Secondary / secondary special | Single / not married | House / apartment | Pensioner | Unaccompanied | NaN | NaN | NaN | NaN | NaN | NaN | 0.0 | 0.0 | NaN | XNA | NaN | 0 | 0 | 0 | 0 | 0.035792 | 2 | 2 | 255204 | NaN | NaN | WEDNESDAY | NaN | NaN | NaN | NaN | NaN | NaN |
9998 | 53095.5 | 1706400.0 | 1350000.0 | 315000.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 2.0 | 0.0619 | 0.0625 | 0.0630 | 0.0498 | 0.0498 | 0.0517 | 2 | 4 | F | NaN | NaN | NaN | -14127 | -278 | -4535 | -496 | -390 | 0.0 | 0.0 | NaN | NaN | NaN | No | 0.1379 | 0.1379 | 0.1379 | 0.234588 | 0.453255 | 0.743559 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | 1 | Y | N | 0 | 0 | 0.1667 | 0.1667 | 0.1667 | NaN | NaN | NaN | NaN | 7 | block of flats | NaN | NaN | NaN | 0 | 0 | NaN | NaN | NaN | 0.0318 | 0.0324 | 0.0331 | Cash loans | Secondary / secondary special | Married | House / apartment | Working | Unaccompanied | NaN | NaN | NaN | 0.0744 | 0.0760 | 0.0788 | 0.0 | 0.0 | Managers | Trade: type 7 | 2.0 | 0 | 0 | 0 | 0 | 0.018801 | 2 | 2 | 112126 | 0.0412 | Panel | SATURDAY | 0.9831 | 0.9831 | 0.9831 | NaN | NaN | NaN |
9999 | 30838.5 | 601470.0 | 450000.0 | 225000.0 | 0.0 | 0.0 | 0.0 | 1.0 | 0.0 | 4.0 | NaN | NaN | NaN | NaN | NaN | NaN | 2 | 4 | F | NaN | NaN | NaN | -13799 | -1360 | -4316 | -1802 | -2482 | 1.0 | 1.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 0.373797 | 0.605333 | 0.780144 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | N | Y | 1 | 0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 12 | NaN | NaN | NaN | NaN | 0 | 0 | NaN | NaN | NaN | NaN | NaN | NaN | Cash loans | Secondary / secondary special | Civil marriage | House / apartment | Working | Unaccompanied | NaN | NaN | NaN | NaN | NaN | NaN | 1.0 | 1.0 | Managers | Self-employed | NaN | 0 | 0 | 0 | 0 | 0.020246 | 3 | 3 | 347521 | NaN | NaN | MONDAY | NaN | NaN | NaN | NaN | NaN | NaN |