Questions tagged [machine-learning]

Implementation questions about machine learning algorithms. General questions about machine learning should be posted to their specific communities.

0
votes
0answers
3 views

PCA appliction on MNIST Dataset: Memory Error

I'm trying to apply the PCA algorithm to compress the mnist handwritten dataset in order to improove my Neural Network performance. So i wrote this function in python3: def pca(X,K): ...
-1
votes
0answers
5 views

In a data set which factors determine that data provision can be made using machine learning and/or other models for prediction?

I have a dataset that I whish to predict the Earthquakes that may happen or/and where exactly will they happen. The columns are these from the data set. -Year -Month -Date -Hours -Minutes -...
-3
votes
0answers
5 views

PROGRAM IS :- Predict the price of the google stocks for the next 15 days using the Quandl dataset.(Available here)

PROGRAM IS :- Predict the price of the google stocks for the next 15 days using the Quandl dataset.(Available here) ( in python )
0
votes
0answers
7 views

I need to make a dataset for a StackGAN,but I do not know how to link the image to text descriptions

I'm working on a text-to-image generating system, using StackGANs. I want to make my own dataset, but since a COCO format for 1000 images would take forever to make, I need a quick way to link text ...
0
votes
0answers
38 views

Is there something wrong with nested functions in Python? [on hold]

I tried to find some information about nested functions, but almost every article is just about their existence and how to use them. I am not really experienced in python, so it would be nice to get ...
-1
votes
0answers
6 views

How to implement Batch Normalization into DDPG?

So i got this DDPG implementation and i wanted to try to apply batch_normalization on it, but i can't figure out how to do it, it seems to be really difficult?! Could somebody help me out? DPPG Code:...
-3
votes
0answers
10 views

What's the best analytics tool for unstructured data? [on hold]

I have 100 TB of data including images, files, project folders in the NetApp Storage. Is there any tool to dig the data and grab the related information. For example, If I search 'burj khalifa', the ...
0
votes
0answers
41 views

decision tree implementation in JAVA

I'm trying to implement decision tree in JAVA from scratch based on the following dataset. I am struggling to think of how to represent a decisionNode and leafNode, like which properties to have and ...
0
votes
0answers
22 views

Is it valid to train the autoencoder before building the encoder/decoder models?

I am following the tutorial https://blog.keras.io/building-autoencoders-in-keras.html to build my autoencoder. For that, I have two strategies: A) Step 1: build autoencoder; Step 2: build encoder; ...
0
votes
0answers
10 views

Action Recognition for multiple objects and localization

I want to ask question regarding the action detection on the video with proposed frames. I've used Temporal 3D ConvNet for the action recognition on video. Successfully trained it and can recognize ...
-2
votes
0answers
11 views

How can i decide the processing power based on the dataset?

I learned from some articles on the internet says to train a model(in machine learning) computer needs more processing power then powerful CPU is needed, since it need large data set, it needs more ...
0
votes
0answers
15 views

SegNet: Why is training accuracy high but validation accuracy low?

I try to train a SegNet-Basic implemented in Keras on the Berkeley DeepDrive dataset. But after training the validation accuracy settles at about 50-60% while the training accuracy climbes to over 90%....
0
votes
1answer
28 views

Loss not decreasing - Pytorch

I am using dice loss for my implementation of a Fully Convolutional Network(FCN) which involves hypernetworks. The model has two inputs and one output which is a binary segmentation map. The model is ...
-1
votes
0answers
19 views

Lowest Scrabble Score Automated Program

I want to develop a program that tries different combinations of words in Scrabble to get as close as possible to the lowest possible total score: https://puzzling.stackexchange.com/questions/80939/...
0
votes
1answer
25 views

Distance Calculations for Nearest Mean Classifer

Greetins, How can I calculate how many distance calculations would need to be performed to classify the IRIS dataset using Nearest Mean Classifier. I know that IRIS dataset has 4 features and every ...
-2
votes
0answers
14 views

Seq2Seq using Automatic Sentence Encoder

I'm working on a problem where I need to predict the next sequence/label. I have a dataset of the type: [fam_1][text_grand_father][GF] [fam_1][text_ father][F] [fam_1][text_Mother][M] [fam_1][...
-4
votes
0answers
31 views

Poor accuracy in classification of well defined data [on hold]

I have a well defined data where i have cleaned up my data to final form which has 20 features mapping to a number between 1 to 100. Upto 5 features are enabled(value set to 1) for each row. The data ...
-3
votes
0answers
20 views

Different results for accuracy in the same dataset after applying SMOTE to text classification using Python

I am trying to solve the problem of imbalanced dataset using SMOTE in text classification while using TfidfTransformer and K-fold cross validation. I want to solve this problem by using Python code. ...
-1
votes
1answer
18 views

Binary column correlation with numeric column

Is that possible to check correlation between binary column and numeric column in Python, when data are contained in DataFrame? Is there any dedicated functionality to do this?
-1
votes
0answers
19 views

youtube recommendation, how nearest neighbor works for candidate generation?

I'm trying to understand https://storage.googleapis.com/pub-tools-public-publication-data/pdf/45530.pdf Their candidate generation step outputs topn items via softmax (with negative sampling) at ...
-1
votes
0answers
11 views

Hyperparameter Tuning for SVC

I've been trying to figure out the best way to tune parameters of rbf SVC. I know of nested cross-validation and their variants, but if you implement your own classifier, should you perform ...
-1
votes
0answers
12 views

How would I predict with Python RandomForestRegressor in the case of one or more missing inputs?

I am training a sklearn Random Forest Regression model in Python with a dataset composed of various features and none of the data is null. However, there are some cases when I may want to predict ...
0
votes
0answers
22 views

extracting image feature vectors in convolution neural network

i have an image dataset that consists of 376 classes . each class consists of 15 images for a specific person .class names (which are the labels) are strings from 000-377 . i want to get the feature ...
0
votes
0answers
40 views

How to implement the One-Hot Encoding?

I was doing some machine learing exercices, some data has qualitative variables, like sex:male, female. When build the model, we know the qualitative variables should be set to numbers, like male to ...
-3
votes
0answers
14 views

What is the best ML algorithm for frequently changing object identification in automation testing?

I am a tester working on selenium. I will perform automation on my web application using xpath. During developement developer changes the elements(tags, attributes) frequently. So i have to rewrite ...
-1
votes
1answer
38 views

Best pratices to select features in large dataset [on hold]

I have a large dataset with 276 features and 150 000 rows. Some of the features are categorical and several possible values or class (between 2 and 30 class). To predict a target variable (Y), I have ...
-1
votes
0answers
16 views

Encode ids/references columns [on hold]

I am currently working on a small dataset (~30k), with two skewed classes, (80/20). My problem is that my dataset is full of « ids » colums or  « references ». Like : City Neighbour ...
0
votes
0answers
9 views

Gensim doc2vec most similar gives unsupported operand type(s) error

I am using a pre-trained doc2vec model, when I try to find out most similar document to that of my sample document. It gives me unsupported operand type(s) error. from gensim.models import Doc2Vec ...
0
votes
1answer
19 views

How to solve logistic regression using gradient descent in octave?

I am learning Machine Learning course from coursera from Andrews Ng. I have written a code for logistic regression in octave. But, it is not working. Can someone help me? I have taken the dataset ...
1
vote
0answers
35 views

OutputProjectionWrapper vs fully connected layer on top of RNN

I'm reading the 14th chapter of Hands-On Machine Learning with Scikit-Learn and TensorFlow. It says: Although using an OutputProjectionWrapper is the simplest solution to reduce the dimensionality ...
0
votes
0answers
10 views

How to ensure the integrity of validation data in time series problem and set up a baseline properly

I am working on a time series problem where i have multiple possibly correlated datasets where my samples are timestamped and have a certain amount of lookback. My ultimate goal is to essentially ...
-1
votes
0answers
14 views

Automatic object cropping on a variety of images - building image dataset

I'm trying to build an image dataset from scratch, using images gathered from search engines. The problem I'm facing is that this objects usually are repeated in the images that are retrieved. What is ...
1
vote
0answers
21 views

Not able to interpret decision tree when using class_weights

I'm working with an imbalanced dataset. I'm using a decision tree (scikit-learn) to build a model. For explaining my problem I've taken iris dataset. When I'm setting class_weight=None, I understood ...
0
votes
2answers
40 views

Using Pandas Dataframe in TensorFlow - X and Y values

I'am trying to follow this tutorial: https://www.youtube.com/watch?v=G7oolm0jU8I&list=PLIivdWyY5sqJxnwJhe3etaK7utrBiPBQ2&index=3 But since he is importing with old tf functions and he is ...
-1
votes
0answers
12 views

Deployment of Azure ML Experiment as a Webservice through Visual Studio?

How to deploy azure ml experiment as a web service through visual studio
0
votes
0answers
17 views

Neural network regression with multi-value (probabilistic) functions

I'm a bit of a beginner in the art of machine learning. Here is a rather conceptual question I've been wondering: Suppose I have a function X->Y, say y=x^2, then, generating enough data of X->Y, I ...
-1
votes
0answers
27 views

How to make prediction in logistic regression

I have data - old investment advices from popular company: X = ['pref', 'reg', 'fut', 'cur',...] (only 4 clases - types of financial instruments. pref = preferred stock, reg = regular stock, fut = ...
-1
votes
0answers
16 views

Algorithm to detect outliers in network sensor messages

I have a network sensor device which generates a number of messages. The message is of format "timeofmessage messagetype messageimportance messagetext". The sensor keeps producing "sensor-ok" messages ...
0
votes
0answers
33 views

How can I use different length of input and output data in MLP using Keras or Tensorflow?

I wonder how can I input different length of data to multilayer perceptron and get different length of output data. Let's assume that I want to fit data separately (not in batches) in the way shown ...
-1
votes
0answers
25 views

Interfacing my machine learning code (through a REST API) to my app by using Volley [duplicate]

I am trying to learn how can I deploy my machine learning code that uses random forest classifier on iris data set to classify flowers. But I want the data for classification to be fed from my android ...
0
votes
0answers
26 views

How to use 1D CNN for Multivariate model

DataI am a newbie to machine learning.I have done some projects using univarate models. I am using following code to train model on stock prices.A time series prediciton of stock using 1D CNN. import ...
0
votes
0answers
36 views

ValueError: Error when checking input: expected lstm_1_input to have 3 dimensions, but got array with shape (6782, 36)

I tried to do human pose action recognition model, I referred this model I like to use LSTM for this model. So I have made some changes in train.py My train.py code: import pandas as pd from ...
-1
votes
0answers
26 views

Mean encoding categoricals for unseen values [on hold]

Pls correct me , is there a way to use the mean encoding to encode new UNSEEN values or should i again train the data with newly introduced values.? My dataset has lots of item names which are key ...
0
votes
1answer
22 views

How to pass test data to obtain model predictions if onehotencoder applied to train data

I am using Sklearn.preprocessing to preprocees (onehotencoder) the categorical data. onehotencoder = OneHotEncoder() pre_loc_data1 = onehotencoder.fit_transform(pre_loc_data1.astype(str)).toarray() ...
1
vote
0answers
16 views

Extract top words for each cluster

I have done K-means clustering for text data #K-means clustering from sklearn.cluster import KMeans num_clusters = 4 km = KMeans(n_clusters=num_clusters) %time km.fit(features) clusters = km.labels_....
0
votes
0answers
15 views

Derivation of InfoGAN function

I am confused as how the authors of InfoGan paper InfoGAN: Interpretable Representation Learning by Information Maximizing Generative Adversarial Nets (https://arxiv.org/pdf/1606.03657.pdf) have ...
2
votes
1answer
34 views

Machine Learning Gradient descent python implementation

Problem I have written this code, but this is giving errors: RuntimeWarning: overflow encountered in multiply t2_temp = sum(x*(y_temp - y)) RuntimeWarning: overflow encountered in ...
0
votes
0answers
22 views

How to correct the graph

Using the Logistic regression algorithm implemented by the Python Scikit-learn, classify the three types of flowers (Setosa, Versicolor, Virgin) in Iris dataset according to the Petal length and width....
0
votes
1answer
33 views

Handling categorical variables in sklearn with one-hot encoding

Can someone help with any existing Python class for categorical encoder for sklearn that ticks the following checkboxes? pandas friendly - option to return a dataframe should be able to drop 1 column ...
2
votes
0answers
17 views

How to load CIFAR-10 datasets using Pickle library on jupyter notebook?

This is what i did and nothing showing to be extracting. 1. downloaded this on my computer: https://www.cs.toronto.edu/~kriz/cifar-10-matlab.tar.gz 2. made a folder on jupyter notebook by name '...

http://mssss.yulina-kosm.ru