azure-ml

Azure Cloud SQL - Querying large number of rows with Python

Allen Wu

2022年6月2日 17:04

I have a Python Flask application that connects to an Azure Cloud SQL Database, and uses the Pandas read_sql method with SQLAlchemy to perform a select operation on a table and load it into a dataframe. recordsdf = pd.read_sql(recordstable.select(), connection) The recordstable has around 5000 records, and the function is taking around 10 seconds to execute (I have to pull all records every time). However, the exact same operation with the same data takes around 0.5 seconds when I'm selecting …

Topic: azure-ml pandas python databases

Category: Data Science

Score Columns in Azure ML Studio

Jon

2022年5月2日 21:06

So I have a data set I have successfully used to train a model, and have decent results. I am using a Two Class Boosted Decision tree for a Boolean output. So far so good. I now want to analyze each column of my data set and remove any column that is not a meaningful influence on the outcome. I see statistics on columns in my data set: But I don't see whether a column has a strong relationship with …

Topic: azure-ml feature-selection predictive-modeling

Category: Data Science

Azure ML / AutoML: problem with univariate time series forecasting

movingabout

2022年4月18日 15:06

I'm having troubles generating univariate time series forecasts with Azure Automated Machine Learning (I know...). What I'm doing So I have about 5 years worth of monthly observations in a dataframe that looks like this: date target_value 2015-02-01 123 2015-03-01 456 2015-04-01 789 ... ... I want to forecast target_value based on past values of target_value, i.e. univariate forecasting like ARIMA for instance. So I am setting up the AutoML forecast like this: # that's the dataframe as shown above …

Topic: automl forecasting azure-ml

Category: Data Science

No result for scored labels in Azure ML Web Service

Liu Bei

2022年4月16日 01:06

I am trying to predict scored labels using regression. But when I am about to get the result from Azure ML Web Service in Excel 2016, there is no result appeared in the scored label column. How should I fix this? Below is all my process... Here is my problem I always get. As you are seeing now, there is no result in scored label column when I try to predict.

Topic: azure-ml excel regression machine-learning

Category: Data Science

Mean Absolute Error increasing with more correlated factors

user68978

2022年4月9日 00:06

I am using Microsoft Azure Machine Learning Studio to predict stock market prices. We have the variables- Index price(target-to be predicted),Low price,High price,dates and days. We use split of 0.7 and run Linear regression. We get Mean absolute error of 109. We then try to add more variables(macroeconomic factors which positively effect the index prices) which are correlated with the target variable and should improve the predictions- we find that the Mean Absolute error increases to 110.I have attached the …

Topic: azure-ml regression machine-learning

Category: Data Science

Deployment in AzureML for NLP with fastText

cris2019

2022年3月14日 14:04

I am new to Azure ML. I am working on sentimental analysis on a small tweet dataset with the help of fastText embedding (fastText file 'wiki-news-300d-1M.vec' is around 2.3 GB which I downloaded in my folder). When I run the program in the Jupyter notebook everything runs well. But when I try to deploy the model in Azure ML, while I attempt to run the experiment: run = exp.start_logging() run.log("Experiment start time", str(datetime.datetime.now())) I am getting the error message: While …

Topic: jupyter azure-ml

Category: Data Science

PicklingError in pyspark (PicklingError: Can't pickle <class 'main.Person'>: attribute lookup Person on main failed)

kamal kumawat

2022年3月9日 23:09

I am unable to pickle the below class. I am using data bricks 6.5 ML (includes Apache Spark 2.4.5, Scala 2.11) import pickle class Person: def __init__(self, name, age): self.name = name self.age = age p1 = Person("John", 36) pickle.dump(p1,open('d.pkl','wb'))``` PicklingError: Can't pickle <class '__main__.Person'>: attribute lookup Person on __main__ failed

Topic: pickle azure-ml pyspark apache-spark python

Category: Data Science

Problem importing CNTK in Azure jupyter notebook

Hermes Morales

2022年3月9日 15:04

Vineeth Sai indicated in this that with the following code: pip install cntk the problem is solved. However, I am getting the error shown in attached image:

Topic: pip jupyter azure-ml python

Category: Data Science

Is it possible to export a model from Azure Cognitive Services, import it in Azure ML Studio and then train it from scratch?

MegaKruk

2022年3月3日 11:20

I have access to both Azure Machine Learning Studio and Azure Cognitive Services. Ideally I'd like to export any model that will do a good job at detecting certain objects belonging to a certain class present on a picture from Azure Cognitive Services, then import that model into Models in Azure Machine Learning Studio and then train it from scratch on my own dataset. My question is: is that possible? If the answer is 'no' then what would be the …

Topic: azure-ml

Category: Data Science

Is it possible to use TensorFlow inside a python script in Azure Machine Learning Studio?

Sam

2022年3月1日 17:02

I'm trying to get TensorFlow running inside a python script in Azure Machine Learning Studio. As TensorFlow is not part of Azure Machine Learning Studio, I needed to import it using a zip file. I followed the instructions here: https://stackoverflow.com/questions/44593469/how-can-certain-python-libraries-be-imported-in-azure-mllike-the-line-import-hu However, when trying to import TensorFlow, I get: ImportError: No module named _pywrap_tensorflow_internal Failed to load the native TensorFlow runtime. It seems like TensorFlow is much more than just a python library. It seems like it needs a native library …

Topic: tensorflow azure-ml

Category: Data Science

Estimating the uncertainty of regression models

Maria

2022年2月2日 23:07

Given a regression model, with n features, how can I measure the uncertainty or confidence of the model for each prediction? Suppose for a specific prediction the accuracy is amazing, but for another it's not. I would like to find a metric that will let me decide if, for each frame, I would like to "listen" to the model or not.

Topic: machine-learning-model azure-ml deep-learning python machine-learning

Category: Data Science

Azure AutoML time series endpoint data input

Mr.Questionmark

2021年12月30日 02:12

I am using Azure ML studio AutoML to train a best time series model with TCNForcaster algorithm and deploy it as web service. since this is using a deep learning algorithm and the request is different than simple algorithm. I do not know how to enter my request data for forecasting below. I have tried lots of ways but always got "error":"'date'". { "data": [ { "_automl_target_col_WASNULL": 0, "_automl_target_col_season": 0, "_automl_target_col_trend": 0, "_automl_year": 0, "_automl_half": 0, "_automl_quarter": 0, "_automl_month": 0, …

Topic: azure-ml

Category: Data Science

Service Request classification, questionnaire filling and call logging

rahul raj

2021年12月6日 16:03

I am very new to machine learning. I just went through some of the tutorials in Azure and completed one practice workflow(car price prediction). I hope I can ask basic questions here. Scenario : We get service request from our customers via email. This has fields like customer name, user name, email id, Equipment affected, type of call and Issue experienced(this is a free text area). The employee reads this email, mainly the issue experienced. Based on the issue experienced …

Topic: prediction azure-ml predictive-modeling machine-learning

Category: Data Science

Multiple Merges make the data frame in pandas to explode and causing Memory Issue in jupyter notebook

Devendran

2021年11月10日 13:57

I have made a multiple merges using pandas data frame (refer the example script below). It made the data frame to explode and consume more memory as it records reach to 18 Billion in df3 and try to merge with 5Lack records in df4. This causing the memory issue. It consumes the whole memory in RAM(140 GB of memory) and session got killed. df = df1[df1_columns].\ merge( df2[df2_columns], how='left', left_on='col1', right_on='col2' ).\ merge(df3[df3_columns], how='left', on='ID').\ merge(df4[df4_columns], how='left', on='ID') ) Appreciate …

Topic: jupyter azure-ml pandas python

Category: Data Science

What are the value propositions of Databricks compared with GCP, AWS or Azure?

felixthecar

2021年11月2日 13:16

Especially when considering GCP, the analytics offer from Google is quite interesting. Why would you go with Databricks? GCP has also great integration between tools as well as great support for ML/AI, etc.

Topic: google-cloud azure-ml aws

Category: Data Science

Test dataset contains invalid data. ( Error 0018 ) in Azure ML Studio Evaluate Recommender

Jeyakeerthanan

2021年9月12日 07:21

I am doing a crop recommender system using the Matchbox recommender system in Azure ml studio. while splitting the dataset using Recommender split, it won't be split. but I split while using split rows, it works. but when evaluating recommender it shows error like 'Test dataset contains invalid data' how to overcome this issue?

Topic: azure-ml recommender-system

Category: Data Science

Hidden integers being attached to my data in Azure ML

Mirkloheth

2021年9月9日 19:23

I've been trying to solve an issue with a piece of time data for a while now. I cannot convert it to DateTime using the Edit Metadata module, or turn it into a numeric value, or bin it. However, every time I enter it into a model to be trained, the time values return with these underscored integers attached to them as _1, _2, _3, etc. They come at the end of a normal value - for example 01/02/2021 08:33*_1*. …

Topic: azure-ml dataset python r machine-learning

Category: Data Science

Improve a regression model and feature selection

Tasos

2021年8月24日 16:30

I am working on Azure ML Studio and try to create a regression model to predict a numerical value. I will try to describe my features and what I have done until now. My data with about 3 million rows : Features: 8 integer features from 1 to 25 2 boolean features with 0 and 1 3 integer features from 1 to 10 2 integer feature from 0 to 500.000 (and 1.000.000 respectively) with about 4.500 unique values 1 integer …

Topic: feature-engineering azure-ml regression feature-selection predictive-modeling

Category: Data Science

What is the default number of cross-validation folds with the "Tune Model Hyperparameters" module in Azure ML studio?

wordsforthewise

2021年7月15日 20:31

I presume the Azure ML studio's "Tune Model Hyperparameters" module is performing cross-validation, since it shows "average test" metrics like accuracy and precision: However, I don't see a parameter for setting the number of CV folds nor any info in the docs about what this "test" set is. In the pipeline, we are only providing training data (no validation/test set): So is the module performing CV by default? If so, what is the default number of folds? I understand how …

Topic: azure-ml

Category: Data Science

xgboost in R have different results compared to boosted decision tree in Azure ML

Fatima

2021年7月4日 10:03

I have a small data set (4000 records with 10 features) and I used XGBOOST in R as well as Boosted Decision Tree model in Azure ML studio. Unfortunately the results are different. I like to optimize recall and I could pick that as a measure in Azure but I can not do so in R. I used the same parameters in both platforms. I know seeds might be different but I tried many of them. I always have a …

Topic: xgboost azure-ml r

Category: Data Science

About