nosql

Best way to store large amounts of time series data? Relational database (SQL) or NoSQL route. Also additional python/pandas question inside

user1884295

2021年9月3日 02:25

so I'm working on a project and I'm sort of stuck as to how to store my data. I have a concept I want to propose but am unsure whether it is possible, if it is not I would appreciate any help pointing me in the right direction. So as I said im working on a small project, for this project I want to store 2, 2 dimensional arrays every 20 seconds or so and have the time (seconds from …

Topic: data sql pandas dataset nosql

Category: Data Science

How to learn noSQL databases and how to know when SQL or noSQL is better

2021年3月15日 21:14

I want learn about NoSQL and when is better to use SQL or NoSQL. I know that this question depends on the case, but I'm asking for a good documentation on NoSQL, and some explanation of when is better to use SQL or NoSQL (use cases, etc). Also, your opinions on NoSQL databases, and any recommendations for learning about this topic are welcome.

Topic: nosql

Category: Data Science

Is there an overview over recommender system architectures?

TobiasJakob

2021年3月12日 05:46

I want to learn more about the recommender system topic. I am very interested in the usage of different database systems for this use case. My problem is that I cannot find a good overview of different architectures of recommender systems, especially with the focus on the database part. Can someone help me out with a good reference or some own thoughts? Thanks a lot. As interesting as this topic is for me as hard it seems to get some …

Topic: recommender-system nosql databases machine-learning

Category: Data Science

Are there decisive leaders in programming with tabular data?

Monolithguy

2021年1月16日 04:08

What are the most effective bread-and-butter in-memory open source tabular data frameworks today? I have been working with tabular data for years with an in-house solution that integrates with Excel well, but falls short of many other expectations. I would like to (if possible/true) demonstrate that our solution has fallen behind the times. In other words, assuming an SQL-like platform is responsible for persistence of a data set, but cycle intensive calculations need to be performed on that dataset (E.g. …

Topic: data-table sql pandas nosql

Category: Data Science

Deal with huge amount of data

Learner

2020年2月21日 14:22

I'm writing to get advices about my project. I want to make recommander system for shop with some products. In fact i want to recommand to shop A to take item X because shop B sell this item and shops A and B are very similar. The "problem" here is the size of the data : i have around 5TB of raw data (about 8 000 000 000 lines) So it's very difficult to do something with huge data like …

Topic: mongodb python recommender-system nosql

Category: Data Science

The data in our relational DBMS is getting big, is it the time to move to NoSQL?

ePezhman

2019年9月7日 18:23

We created a social network application for eLearning purposes. It's an experimental project that we are researching on in our lab. It has been used in some case studies for a while and the data in our relational DBMS (SQL Server 2008) is getting big. It's a few gigabytes now and the tables are highly connected to each other. The performance is still fine, but when should we consider other options? Is it the matter of performance?

Topic: relational-dbms nosql

Category: Data Science

Running SQL-like queries over large schemaless JSON dataset in the cloud?

Richard

2017年7月5日 13:01

I've got about 5 million JSON files, about 50GB in total. They do not have a consistent schema (they're broadly the same format, but some have extra extension fields, some have missing fields, etc - the schema is quite complexly nested). I would like to run SQL-like queries across these files - e.g. finding the count of files with a certain property, finding the count of files where property is in a numeric or time range, etc. I have the …

Topic: json nosql bigdata

Category: Data Science

Seeking advice on database architecture -- given my problem, what tools should I learn?

generic_user

2017年6月15日 13:39

I'm a fairly experienced R user, but until now I haven't had a good reason to learn to use databases. Now I have a problem where I am dealing with model output that I need to save to disk, and then query for another process. If the data were smaller, I'd store everything in a list, with hierarchical elements. For example, if my object is called output.OLS: 1> summary(output.OLS) Length Class Mode SEP0307 3 -none- list SEP0308 3 -none- list …

Topic: sql r nosql databases

Category: Data Science

When a relational database has better performance than a no relational

Filipe Ferminiano

2017年6月5日 19:30

When a relational database, like MySQL, has better performance than a no relational, like MongoDB? I saw a question on Quora other day, about why Quora still uses MySQL as their backend, and that their performance is still good.

Topic: nosql performance bigdata databases

Category: Data Science

Data representation (NoSQL database?) for a medical study

Karel Macek

2017年6月5日 19:30

Problem description I have a data set about 10000 patients in a study. For each patient, I have a list of various measurements. Some information is scalar data (e.g. age), some information is time series of measurements, some other information can be even a bitmap. The individual record itself can be quite thick (10kB to 10MB). The data is to be processed practically in two steps: Preprocessing at the level of individual records (patients), i.e. to extract some features in …

Topic: mongodb nosql machine-learning

Category: Data Science

What is the Best NoSQL backend for a mobile game

Filipe Ferminiano

2016年12月9日 21:55

What is the best noSQL backend to use for a mobile game? Users can make a lot of servers requests, it needs also to retrieve users' historical records (like app purchasing) and analytics of usage behavior.

Topic: nosql performance

Category: Data Science

Any Master Thesis Topics related to NoSQL and Machine Learning or Business Intelligence?

John Newman

2016年4月5日 01:22

Im currently in the last year, and I want to do a masters thesis on a topic that has NOSQL and Machine Learning or Business Intelligence. In my topic i want for defintely NOSQL, so I want to add a complementary topic (machine learning or business intelligence) to it. From my research i know that NOSQL: provides a mechanism for storage and retrieval of data which is modeled in means other than the tabular relations used in relational databases. And …

Topic: nosql machine-learning

Category: Data Science

NoSQL vs SQL backend for semi structured data

aamir23

2016年3月19日 04:09

I have a corpus of job descriptions and another corpus of CVs of applicants. I plan to implement a matching system using machine learning algorithms, to find top 5 or top 10 applicants for each job description. Should I store the data in a document oriented NoSQL db (MongoDB) or stick to SQL. Given that the data I have is semi-structured at best, I feel a NoSQL db will offer more flexibility. I would appreciate opinions on this.

Topic: sql nosql databases

Category: Data Science

Uses of NoSQL database in data science

jithinjustin

2015年11月7日 07:51

How can NoSQL databases like MongoDB be used for data analysis? What are the features in them that can make data analysis faster and powerful?

Topic: mongodb nosql bigdata

Category: Data Science

Python interface to Titan Database

Sreejithc321

2015年5月27日 03:38

How can I connect to Titan database from Python ? What I understand is that Titan (Graph database) provides an interface (Blueprint) to Cassandra (Column Store) and bulb is a python interface to graph DB. Now how can I start programming in python to connect with titan DB? Is there any good documentation/tutorial available ?

Topic: python nosql databases

Category: Data Science

What is the difference between Hadoop and noSQL

рüффп

2015年5月18日 12:30

I heard about many tools / frameworks for helping people to process their data (big data environment). One is called Hadoop and the other is the noSQL concept. What is the difference in point of processing? Are they complementary?

Topic: apache-hadoop processing tools nosql

Category: Data Science

Data store for testing data products?

bobfet1

2015年5月13日 01:18

Is there a recommended approach for storing processed data for testing new data products? Basically, I'd like to have a system where a data scientist or an analyst could think of a new data product to present to users, do the data processing to create it, and then put it in a data store that our application can then access easily. What I'm not sure about is what kind of data store would be good for this type of "testing" …

Topic: sql nosql

Category: Data Science

Is this Neo4j comparison to RDBMS execution time correct?

blunders

2015年5月10日 21:18

Background: Following is from the book Graph Databases, which covers a performance test mentioned in the book Neo4j in Action: Relationships in a graph naturally form paths. Querying, or traversing, the graph involves following paths. Because of the fundamentally path-oriented nature of the datamodel, the majority of path-based graph database operations are highly aligned with the way in which the data is laid out, making them extremely efficient. In their book Neo4j in Action, Partner and Vukotic perform an experiment …

Topic: neo4j nosql databases

Category: Data Science

Can hadoop with Spark be configured with 1GB RAM

user4290511

2015年2月28日 13:28

I'm trying to set up a cluster (1 namenode, 1 datanode) on AWS. I'm using free one year trial period of AWS, but the challenge is, instance is created with 1GB of RAM. As I'm a student, I cannot afford much. Can anyone please suggest me some solution? Also, it would be great if you could provide any links for setting up multi cluster hadoop with spark on AWS. Note: I cannot try in GCE as my trial period is …

Topic: aws apache-hadoop nosql bigdata

Category: Data Science

is this a good case for NOSQL?

nassimhddd

2014年7月23日 07:01

I'm currently facing a project that I could solve with a relational database in a relatively painful way. Having heard so much about NOSQL, I'm wondering if there is not a more appropriate way of tackling it: Suppose we are tracking a group of animals in a forest (n ~ 500) and would like to keep a record of a set of observations (this is a fictional scenario). We would like to store the following information in a database: a …

Topic: nosql databases

Category: Data Science

About