Rukey - Alway believe that something wonderful is about to happen

「 AZKABAN 」 August 20, 2018

azkaban java api

Words count 884 Reading time 1 mins.

Azkaban

https://github.com/shirdrn/azk-client

https://github.com/Jacker-Wang/AzkabanHandle

https://github.com/runningdata/azkaban_client

https://github.com/SamPiy93/AzkabanJavaAPI

Read article

「 AMBARI二次开发」 August 20, 2018

Ambari多语言处理

Words count 31k Reading time 28 mins.

Ambari二次开发

在ambari-web/app下新建目录locales，然后在locales目录下创建zh、en目录，分别存放中文、英文语言js文件。

en下的message.js为英文语言，跟app目录下的message.js相同

zh下的使我们要汉化的语言。翻译相对应的字段即可。

1.1状态修改

修改modes/host_component.js 243行

    getTextStatus: function (value) {
        switch (value) {
            case this.installing...

Read article

「 AMBARI二次开发」 August 20, 2018

Ambari二次开发-实时编译-部署集成环境搭建

Words count 12k Reading time 11 mins.

Ambari二次开发

总体思路：对ambari-web、ambari-admin进行单独编译，利用brunch watch功能，对文件源码进行实时监控编译。ambari-web/public 里存放编译完成的文件，（这里的文件目录与部署ambari后 /usr/lib/ambari-server/web 里的相同），对/usr/lib/ambari-server/web 建立软链接到public，就能实现开发-编译-web可视化的功能。

开发人员只需走的流程：

1. 本地访问samba共享目录里的源码进行二次开发
...

Read article

「机器学习」 August 19, 2018

1-1 numpy数组合并

Words count 5.2k Reading time 5 mins.

import numpy as np

x = np.array([1,2,3])
y = np.array([3,2,1])

array([1, 2, 3])

array([3, 2, 1])

x与y合并

np.concatenate([x,y])

array([1, 2, 3, 3, 2, 1])

z = np.array([666,666,666])

np.concatenate([x,y,z])

array([  1,   2,   3,   3,   2,   1, 666, 666...

Read article

「机器学习」 August 19, 2018

1-3 numpy中的矩阵聚合操作

Words count 8k Reading time 7 mins.

import numpy as np

L = np.random.random(100)
L

array([0.25119175, 0.92977429, 0.99646455, 0.01350068, 0.10785383,
       0.13216408, 0.53743682, 0.2362282 , 0.43276904, 0.88723033,
       0.32363798, 0.7736106 , 0.30140572, 0.70171898, 0.23628926,
       0.11465989, 0....

Read article

「机器学习」 August 19, 2018

1-4 numpy中Fancy Indexing

Words count 4.5k Reading time 4 mins.

import numpy as np

x = np.arange(16)
x

array([ 0,  1,  2,  3,  4,  5,  6,  7,  8,  9, 10, 11, 12, 13, 14, 15])

x[3]

x[3:9]

array([3, 4, 5, 6, 7, 8])

x[3:9:2]

array([3, 5, 7])

[x[3],x[5],x[8]]

[3, 5, 8]

ind = [3,5,8]

x[ind]

array([3, 5, 8])

ind...

Read article

「机器学习」 August 19, 2018

1-2 numpy.array中的运算

Words count 10k Reading time 9 mins.

import numpy as np

给定一个向量，让向量中每一个数乘以2
a = (0,1,2)
a*2 = (0,2,4)

普通python中

n = 10

L = [i for i in range(n)]

[0, 1, 2, 3, 4, 5, 6, 7, 8, 9]

2 * L

[0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 0, 1, 2, 3, 4, 5, 6, 7, 8, 9]

A = []
for e in L:
    A.append(2*e)

[...

Read article

「机器学习」 August 19, 2018

2-6 k近邻算法总结

Words count 202 Reading time 1 mins.

最大的缺点：效率低下

如果训练集有m个样本，n个特征，则预测每一个新的数据，需要O(m*n)

优化，使用树结构：KD-Tree,Ball-Tree

缺点2：高度数据相关

缺点3：预测结果不具有可解释性

缺点4：维数灾难

随着维护的增加，“看似相近”的两个点之间的距离越来越大

解决方法：降维

Read article

「机器学习」 August 19, 2018

5-3 MNIST数据集

Words count 3.2k Reading time 3 mins.

import numpy as np
from sklearn.datasets import fetch_mldata

mnist = fetch_mldata("MNIST original",data_home="/Users/shirukai/Desktop/MachineLearn/datasets/")

mnist

{'COL_NAMES': ['label', 'data'],
 ...

Read article

「机器学习」 August 19, 2018

6-1 多项式回归

Words count 79 Reading time 1 mins.

Read article