Skip to content

Latest commit

 

History

History
40 lines (38 loc) · 1.9 KB

File metadata and controls

40 lines (38 loc) · 1.9 KB

TODO

✔️ = Done 🏃 = In Development ​ 📆 = Put On the Agenda 🤔 = Not Sure

  • Data Structures
    • DataSet (3-D data structure) ✔️
    • Frame (2-D general data structure)​ ✔️
    • SeriesSet (2-D general data structure) ✔️
    • Matrix (2-D mathematical data structure) ✔️
    • Row (1-D general data structure) ✔️
    • Series (1-D general data structure) ✔️
    • TimeSeries (1-D time sequence data structure)​ 🏃
  • Statistics
    • Basic Statistics (mean, std, skewness, kurtosis, frequency, fuantils)​ ✔️
    • Correlation (spearman & pearson) ✔️
    • Analysis of variance ✔️
    • Compare Means (simple T-test, independent T-test) ✔️
  • Operations
    • Beautiful CRUD APIs (create, Retrieve, Update, Delete) ✔️
    • Flexible I/O Tool(supporting multiple source data for input and output) ✔️
    • Dummy Variables (auto parse norminal variable into dummy variable) ✔️
    • Difference Sequence Data ✔️
    • Normalize Data (log, normal, standard, box-cox):heavy_check_mark:
    • Drop Duplicate Records ✔️
    • Group By (analysis the dataset under controlling a group variable):heavy_check_mark:
  • Methods
    • LDA (Linear Discriminant Analysis) ✔️
    • LR (Linear Regression) ✔️
    • ANOVA (Analysis of Variance) ✔️
    • MLP (Multi-Layers Perceptron) ✔️
    • DT (Decision Tree):heavy_check_mark:
    • K-Means 🏃
    • PCA (Principal Component Analysis) 🏃
    • ARIMA (Autoregressive Integrated Moving Average) 📆
    • SVM ( Support Vector Machine) 🤔
    • Bayes Classifier 🤔
  • Others
    • Manual 🏃
    • Example Notebook 🏃
    • Unit Test 🏃