41 Commits

Author SHA1 Message Date
4fc2f0c925 extract weighting function 2017-11-10 10:18:13 +01:00
903e81c931 remove identical parameter from data loading function; add runs argument 2017-11-07 20:47:41 +01:00
88e3eda595 refactor hyperband; fix domain generation
integrate hyperband option in training procedure - start refactoring - remove the index erro in generation and add helper functions
2017-11-04 12:47:08 +01:00
8b17bd0701 add TSNE embedding; server evaluation visualization 2017-10-19 17:39:37 +02:00
6fef2b8b84 refactor all visualization for pauls changes - evaluate on max windows per users 2017-09-08 22:59:55 +02:00
9a51b6ea34 refactor test function working on full unfiltered data 2017-09-08 19:10:23 +02:00
edc75f4f44 refactor dataset creation, split up functions 2017-09-08 17:11:13 +02:00
70d00efb01 refactor using joblib for test results, make h5py store/load more flexible 2017-09-08 13:55:13 +02:00
3f6779fa3d load names with data for per-user evaluation 2017-09-02 16:02:48 +02:00
dc9180da10 refactor visualization, change arguments for model type and its depth 2017-09-01 10:42:26 +02:00
933eaae04a change exception type in get_flow_per_user function and replace index to new range index 2017-08-31 13:49:33 +02:00
6e7dc1297c fix lazy domain loading and generation process 2017-08-03 12:27:17 +02:00
7f1d13658f store domain embeddings while test main 2017-08-03 09:08:24 +02:00
f4da147688 refactor cmd argument to have single value for mode 2017-07-30 15:49:37 +02:00
ebaeb6b96e move vocab_size into implementation (not user dependent) 2017-07-30 13:47:11 +02:00
d97785f646 replace softmax by sigmoid in final layer, also adjust dataset for that 2017-07-30 12:50:26 +02:00
b0da2de0ea move utils functions to new file 2017-07-29 19:47:02 +02:00
820a5d1a4d add new network architecture - server label moves to the middle 2017-07-29 19:42:36 +02:00
2593131e9e add embedding visualization and domain encoding generator 2017-07-29 10:43:59 +02:00
18b60e1754 add extended test mode for embeddings 2017-07-17 19:30:56 +02:00
79fc441fe1 wip 2017-07-17 08:44:58 +02:00
d33c9f44ec fix chunks per user function bug caused by numpy version of array_split 2017-07-16 18:49:14 +02:00
844494eca9 add multi-threading for pre-processing 2017-07-16 09:42:52 +02:00
b35f23e518 add visualization for training curves, pr, roc 2017-07-14 14:58:17 +02:00
2afaccc84b refactor argparser into separate file, add logger 2017-07-12 10:25:55 +02:00
9f0bae33d5 refactor dataset generation, add callbacks 2017-07-11 21:06:58 +02:00
a196daa895 add simple flow feature extraction function 2017-07-11 13:46:25 +02:00
522854ee0d add h5 support for pauls best config main 2017-07-11 11:12:03 +02:00
41b38de1ab add feature: generate and use h5 data 2017-07-09 23:58:08 +02:00
fdc03c9922 add h5py example 2017-07-08 17:46:07 +02:00
4a9f94a029 add output for main_test 2017-07-08 15:04:58 +02:00
933f6bf1d7 add feature to use both hits information from dataset 2017-07-06 16:27:47 +02:00
b2f5c56019 refactor dataset generation 2017-07-05 21:19:19 +02:00
772b07847f WPI 2017-07-05 19:16:03 +02:00
a70d1cb03a fix: replace X_tr by its elements; choose selected samples for training data too 2017-07-05 18:37:29 +02:00
5743127b7f new dataset format: multi-lists -> two arrays 2017-07-04 09:18:50 +02:00
c972963a19 network predicts 2 by 2 classes, refactored threshold to main 2017-06-30 18:43:50 +02:00
8334e9a84f removed ys from training data generation 2017-06-30 17:42:18 +02:00
d19036a611 added pauls extensions for new predictions 2017-06-30 17:19:04 +02:00
7ae68cc30e WIP 2017-06-30 10:42:21 +02:00
bbd63fd1da separating logical sections into dataset, models and main.
continued initial refactoring
2017-06-30 10:12:20 +02:00