google-cloud-platform

Why model trains slower on GCP than on my local machine?

Stiefel

2022年3月19日 10:52

I'm using tensorflow-cloud and train a 3D voxel CNN. My local machine: NVIDIA GeForce RTX 2080 Ti 11GB, Intel Core i7 3GhZ, 32 GB RAM This is my machine config on tfc: tfc.MachineConfig(cpu_cores=8, memory=30, accelerator_type=tfc.AcceleratorType.NVIDIA_TESLA_T4, accelerator_count=1), To me this looks comparable. However, the training job takes 2-3 times as long as on my local machine. Do I share the cloud machine with other training jobs? Also the the job might be IO limited, on my local machine my training set …

Topic: google-cloud-platform tensorflow

Category: Data Science

Best way to represent a version feature based on percentiles

Gabriel Ballesteros

2022年2月25日 19:05

We're training a binary classifier in AutoML, and one of the features consist of browser versions. Currently these versions are provided "normalized" to the model, according to the percentile of the browser the current observation falls into. For example, if the percentiles of some specific browser versions are: percentile version p25 34 p50 45 p75 53 p99 70 then an observation with said browser and version=54 would be represented as: p25 p50 p75 p99 1 1 1 0 My question …

Topic: binary-classification google-cloud-platform automl feature-construction feature-extraction

Category: Data Science

Query google trend using google BigQuery

Alvin masievi

2021年12月11日 18:37

I need help with google BigQuery. Am using big query to query data from Google Trends. now I want to get data using a specific keyword example spiderman, and get the result in regions like CSV downloaded in google trend "interest over time". But google trend has this code only view 25 top-trending terms SELECT * FROM `bigquery-public-data.google_trends.top_terms` WHERE refresh_date = DATE_SUB(CURRENT_DATE(), INTERVAL 1 DAY) I want to use same syntax to get data for a specific term/keyword.

Topic: google-bigquery google-cloud-platform

Category: Data Science

TF-IDF for 400,000+ unique words in corpus?

user16584277

2021年9月21日 20:09

I have a corpus with over 400,000 unique words. I would like to build a TF-IDF matrix for this corpus. I have tried doing this on my laptop (16GB RAM) and Google Colab, but am unable to do so due to memory constraints. What is the best way to go about this?

Topic: google-cloud-platform tfidf memory nlp

Category: Data Science

Feature set choice in Google's Vertex AI/AutoML

user2268997

2021年7月23日 21:11

This is a subjective question on utilizing Vertex AI/AutoML in practice. I posted it on stackoverflow and it was closed. I hope it is within scope here. I'm using Google's Vertex AI/AutoML's Tabular dataset models to learn a regression problem on structured data with human engineered features - it's a score/ranking problem and the training target values are either 0 or 1. Our constructed features are often correlated, sometimes the same data point normalized on different dimensions, e.g. number of …

Topic: google-cloud-platform automl feature-selection

Category: Data Science

How to schedule importing data files from SFTP server located on compute engine instance into BigQuery?

Hamza

2021年3月28日 06:32

What I want to achieve: Transfer hourly coming data files onto a SFTP file server located on a compute engine VM from several different feeds into Bigquery with real-time updates effectively & cost-efficiently. Context: The software I am trying to import data from is an old legacy software and does not support direct exports to cloud. So direct connection from software to cloud isn't an option. It does however support exporting data to a SFTP server. Which is not available …

Topic: google-bigquery google-cloud-platform data-engineering etl

Category: Data Science

How can I create a VM instance with GPUs on Google Cloud Platform?

Franck Dernoncourt

2020年9月20日 01:26

How can I create a VM instance with GPUs on Google Cloud Platform? When I go to https://console.cloud.google.com/compute -> CREATE INSTANCE, I only see CPUs and no GPUs, as shown in the video below. I did select a region+zone that is supposed to have GPUs according to https://cloud.google.com/compute/docs/gpus https://cloud.google.com/compute/docs/gpus (mirror): I see that some VMs from the marketplace comes with GPUs but I'd prefer to configure the VM myself.

Topic: google-cloud-platform gpu

Category: Data Science

Why model trains slower on GCP than on my local machine?

Best way to represent a version feature based on percentiles

Query google trend using google BigQuery

TF-IDF for 400,000+ unique words in corpus?

Feature set choice in Google's Vertex AI/AutoML

How to schedule importing data files from SFTP server located on compute engine instance into BigQuery?

How can I create a VM instance with GPUs on Google Cloud Platform?

About