definitions

Origin of the Boolean Model of Information Retrieval

TiMauzi

2022年3月4日 15:00

Simple question, but I can't really find the answer to that: Who "invented" Boolean Retrieval? Of course, I assume that the concept grew over time, but is there a paper or publication that mentions/defines the Boolean Model as a whole for the first time? On Wikipedia, the book by Lancaster and Fayen (1973) is cited, but I couldn't find any definition in there, either.

Topic: history information-retrieval definitions

Category: Data Science

How would you explain Data Science to someone in simple layman terms?

Pluviophile

2022年2月15日 02:04

Data science is an interdisciplinary field that uses scientific methods, processes, algorithms and systems to extract knowledge and insights from noisy, structured and unstructured data, and apply knowledge and actionable insights from data across a broad range of application domains. But if I want to explain it to someone in simple layman terms, I can't explain the concept of Data Science to a layman or grandma. Inspired by this post & the below quote You do not really understand something …

Topic: intuition definitions

Category: Data Science

How is Data Science related to Machine learning?

Subham Tripathi

2022年1月8日 08:20

I went through this comparison of analytic disciplines and this perspective of machine learning, but I am not finding any answers on the following: How is Data Science related to Machine learning? How is it not related to Machine Learning?

Topic: knowledge-base definitions machine-learning

Category: Data Science

Definition adjectives for clustering

Kate P

2021年12月12日 23:17

For a school project, I need to explain which clustering algorithm of Scikit-Learn we need to use based on the input data. The documentation is very well done, especially thanks to a comparative table of algos, but I have trouble understanding some adjectives. I would very much appreciate some definitions for the following terms : Flat / non-flat geometry Even / non-even cluster size Inductive / Transductive In addition, does "Not scalable" mean that the algorithm is not efficient when …

Topic: scikit-learn definitions

Category: Data Science

What is the difference between outlier detection and anomaly detection?

Martin Thoma

2021年12月4日 06:42

I would like to know the difference in terms of applications (e.g. which one is credit card fraud detection?) and in terms of used techniques. Example papers which define the task would be welcome.

Topic: terminology anomaly-detection outlier algorithms definitions

Category: Data Science

What is the name of this technique involving tracking cumulative errors with a forgiveness parameter?

Cain

2021年10月12日 23:08

I'm looking for the name of a technique I've seen used before. Most common in time-series based anomaly detection. It involves keeping a running total of consecutive "error" amounts, generally the difference from a prediction or baseline, and then reacting when the cumulative amount exceeds a specific tolerance level. There needs to be a "forgiveness" amount in this technique that reduces the cumulative error each iteration, to avoid a lot of small errors from eventually stacking up and flipping the …

Topic: anomaly-detection definitions

Category: Data Science

What's The Difference Between The Terms Predictor And Feature

r1d1

2021年10月7日 01:25

For the term 'predictor', I found the following definition: Predictor Variable: One or more variables that are used to determine or predict the target variable. Whereas Wikipedia contains the following definition of the word 'feature': Feature is an individual measurable property or characteristic of a phenomenon being observed. What is the difference between 'predictor' and 'feature' in machine learning?

Topic: definitions machine-learning

Category: Data Science

When are two neural networks independent from each other?

user112820

2021年3月4日 20:07

I want to know how you would define the independence of a neural network.

Topic: neural-network definitions

Category: Data Science

Intuitive explanation of Adversarial machine learning

Pluviophile

2021年2月25日 06:57

How would you explain Adversarial machine learning in simple layman terms for a non-STEM person? What are the main ideas behind Adversarial machine learning?

Topic: intuition adversarial-ml terminology definitions

Category: Data Science

What does anneal mean in the context of machine learning?

Reuben Walker

2020年11月22日 16:16

An article released by Open AI gives an overview of how Open AI Five works. There is a paragraph in the article stating: Our agent is trained to maximize the exponentially decayed sum of future rewards, weighted by an exponential decay factor called γ. During the latest training run of OpenAI Five, we annealed γ from 0.998 (valuing future rewards with a half-life of 46 seconds) to 0.9997 (valuing future rewards with a half-life of five minutes). Does annealing in …

Topic: openai-gym deep-learning definitions machine-learning

Category: Data Science

Is Data Science the Same as Data Mining?

demongolem

2020年8月16日 13:01

I am sure data science as will be discussed in this forum has several synonyms or at least related fields where large data is analyzed. My particular question is in regards to Data Mining. I took a graduate class in Data Mining a few years back. What are the differences between Data Science and Data Mining and in particular what more would I need to look at to become proficient in Data Mining?

Topic: definitions data-mining

Category: Data Science

What is difference between Standard Normal Distribution and Mean Normalization approaches to feature-scaling?

Subhash C. Davar

2020年8月2日 08:23

The tag feature-scaling seems to convey that one of the scaling methods is Standard Normal Distribution. Further, I read an Answer on this site saying that Mean Normalization is a form of feature scaling. What is the difference between two approaches to scaling? Note: I think that statistics and mathematics of normalization do differ.

Topic: normalization feature-scaling sampling statistics definitions

Category: Data Science

What does it mean when we say an algorithm/metric is agnostic

Mr.Robot

2020年7月29日 06:43

Problem I have all kinds of machine learning terms that co-occur with the word "agnostic", including model-agnostic learning, model-agnostic metric. From the dictionary, it explains the word "agnostic" in the following way a person who holds the view that any ultimate reality (such as God) is unknown and probably unknowable. which does not make those terms more understandable. In some contexts, I find that "agnostic" refer to "generic" or "free of". For example, in the paper I am reading now, …

Topic: definitions

Category: Data Science

Is it correct to define the F-measure as the harmonic mean of specificity and sensitivity in such a way?

Qubit

2020年3月2日 11:53

It is common to define the F-measure as a function of precision and recall, as mentioned in [1]: $F_{\beta}=\frac{(1+\beta^2)PR}{\beta^2 P+R}$ However I came across some other cases, another definition is used [2] (without weights): $F = H(sensitivity, 1- specificity)$ Where H is harmonic mean. Reference: F - measure derivation (harmonic mean of precision and recall) https://link.springer.com/chapter/10.1007/978-3-540-68947-8_133. https://stackoverflow.com/a/52892413/2243842

Topic: metric evaluation definitions

Category: Data Science

Is the search for a specific n-gram the same like a string search?

bartman99

2020年1月2日 22:15

Is the result of a search for a specific n-gram like sherlock+holmes equal to the result of a regex search for "sherlock holmes" in the same document corpus? So if i read about n-grams for certain words, that's the same like normal string search? Example: https://books.google.com/ngrams/ https://books.google.com/ngrams/info

Topic: ngrams regex search definitions

Category: Data Science

Is the _error_ in the context of ML always just the difference of predictions and targets?

lo tolmencre

2019年7月29日 13:38

Simple definitional question: In the context of machine learning, is the error of a model always the difference of predictions $f(x) = \hat{y}$ and targets $y$? Or are there also other definitions of error? I looked into other posts on this, but they are not sufficiently clear. See my comment to the answer in this post: What's the difference between Error, Risk and Loss?

Topic: machine-learning-model terminology definitions machine-learning

Category: Data Science

What is the difference between the value -99 and NaN in a data column?

panchester

2019年7月18日 02:35

I am new to data science. I was looking into some datasets and I saw some values like -99, which I discovered later that it means that there is a missing value. Does this mean the same thing as NaN? If it is the same thing, why do we use -99 instead of NaN?

Topic: missing-data beginner definitions

Category: Data Science

What is the relationship between AI and data science?

Martin Thoma

2019年4月24日 05:53

I think they share a lot (e.g. machine learning is a subset of both, right?), but maybe both have elements the other doesn't have? Could you name some in that case? Or is one a subset of the other? What is the relationship between AI and data science? For example, when it comes to the relationship of AI and ML, I always say AI is a superset of ML. And the distinguishing set is search algorithms, which I would include …

Topic: ai reference-request definitions

Category: Data Science

What fits in a Data Description Report/ Data Exploration Report?

user3839833

2019年4月12日 12:15

So I am trying to get familiar with Crisp-DM and found the terms "Data Description Report" and "Data Exploration Report", which seem oddly vague in their definition. So far I only found this right here: https://www.ibm.com/support/knowledgecenter/en/SS3RA7_15.0.0/com.ibm.spss.crispdm.help/crisp_data_description_report.htm But this seems to be on the shorter end in my opinion. Is there any example of a Data Description Report anywhere? If not, is there any systematic methodology you personally use to record your findings while trying to understand data?

Topic: data terminology dataset definitions

Category: Data Science

Why is finite precision a problem in machine learning?

GeorgeOfTheRF

2019年2月13日 06:26

Can you explain what is finite precision? Why is finite precision a problem in machine learning?

Topic: finite-precision terminology definitions machine-learning

Category: Data Science

About