I have no problem importing Excel formatted data into R/R Studio and use all other R packages that I use. But, when I want to use the glmnet package to develop a regularization model, I invariably run into the following error (after specifying my regularization model and attempting to run it): Error in storage.mode(y) <- "double": (list) object cannot be coerced to type 'double' Here is what I have already tried to resolve this: De-format the numbers in Excel (no …
The problem I want to solve is my residential building's garage choices. There will be a random distribution of parking spaces. I thought that it would be better if each person writes down which spaces they want in order of preference, and then their priority of picking a parking slot is randomized. For instance: Person a chooses: p3, p5, p1, p2, p4 Person b chooses: p3, p1, p2, p4, p5 Person c chooses: p1, p3, p2, p5, p4 Person d …
I don't really know if I'm posting in the right place but I hope so. I'm writing my bachelor degree, and I collected some data about fonts and language support. At the moment, all my data is stored in an excel spreadsheet, in the form of: Column: language support (e.g. Cyrillic, Cyrillic extended, Arabic, Vietnamese, Devanagari, etc) Rows: names of the fonts that supports that language, up to 890 Note that some of there fonts, even though they occur only …
I am facing an issue with an excel file. I have an excel sheet with 2 columns Column A : Time Increment with per second Column B : A particular value of a machine sensor The problem i am facing is when the machine is stopped (not in motion), the depth increment stop for that particular time and do not make entries in the excel sheet, and once the machine start moving it again add the entries for the starting …
I am trying to predict scored labels using regression. But when I am about to get the result from Azure ML Web Service in Excel 2016, there is no result appeared in the scored label column. How should I fix this? Below is all my process... Here is my problem I always get. As you are seeing now, there is no result in scored label column when I try to predict.
I am trying to write data ("log" kind of data) onto an excel sheet in MS SharePoint. I have previous experience in writing onto google spreadsheets using python. However, I don't know how to write directly onto sharepoint's excel sheet. If someone can help me out on that, it would be great! TIA :)
I have two separate datasets (eg. dataset1.xlsx and dataset2.xlsx). Dataset1 has 2 columns, serial number and service address. Similarly dataset2 has 2 columns service address and customer number. The problem is that the address in both these datasets have spelling errors, for example one of the address in dataset 1 is 790 spring ln, york while the dataset 2 has the same address as 790 spring lane, york. So there is a difference in spelling in both columns. I am …
I have a dataset A and dataset B, both data has a column in common which is the ID, i want to check if dataset A ID values can be found in dataset B ID values. Also get the records of dataset B that have the same value with dataset A. I can do this on excel using vlookup, but I want to automate the process using python. NOTE: 1. dataset B has 10020 rows while dataset A has 2039 …
I need to find all duplicate rows (string values) in "Name" column and then find out if two numerical values in "Amount" column sum up to a third value also in the "Amount" column in an Excel tab in Pandas (Python)? There are two tabs in this worksheet. I'm referring to the second tab called "Table2". For example, in the table below, I have several duplicates in the "Name" column. But for "Richard Madden" duplicates, corresponding values in "Amount" table …
I have some percentage values in my excel spreadsheet like: When i use the STDEV function to calculate the standard deviation with these values, i get 0,125. How should i interpret this? Is it +- 0,125% or +-12,50%?
So here is my data. For all of four segments I want to build a single diagram which contains all four parameters for each segment. How can it be done in excel or Google Sheets?
I would like to find some good courses but also a quick response on how to model transition matrix given the states. Imagine having 4 states and the following array [1,2,4,1,3,4,2 etc etc]. What calculations are possible with only an array of states? You can make the array as long as you want, I just gave a random exemple. Python ; Excel; Blog solutions are welcome
I have a dataset with a time column and a separate date column in .xlsx format. time column has values in the below format: 12:32:21.499145197 12:32:21.499145197 date column has values in the below format: Apr 10, 2018 Apr 10, 2018 When I read the Excel file in Python, both get object datatype. So I first correct the datatypes. For date I use below code: df["dateConv"]= pd.to_datetime(df["date"]) df["dateConv"] I am unable to correct the datatype for the time column. I tried …
I am pretty new to Python and Pandas and I struggle with combining a messy dataframe from excel with a mapping. I have tried to find some solutions on the Internet, however with no success. My first df_1 is as followed: Product Name Val_1 Val_2 Val_3 Val_4 Prod_1 Level 1 High Yes Prod_1 Low No Level 2 Prod_2 Ab Standard No Prod_2 Bc Non Standard Prod_2 Non Standard Yes Bc Prod_3 High Standard Prod_3 a Complex Low As you can …
I am trying to delete a column from my csv file (column 'A' called "Film Number") but have tried numerous variations of code and while it deletes the column in the dataFrame it doesn't do so in the actual excel or csv file, it just deletes the column name but not the entire column. I'm using PyCharm. Here's the code. Any insights or help would be greatly appreciated. import pandas as pd # create a dataframe from the csv file …
I have the following values in an excel sheet Aguascalientes Baja California Baja California Sur Campeche Chiapas Chihuahua Ciudad de México Coahuila Colima Durango Estado de México Guanajuato Guerrero Hidalgo Jalisco Michoacán Morelos Nayarit Nuevo León Oaxaca Puebla Querétaro Quintana Roo San Luis Potosí Sinaloa Sonora Tabasco Tamaulipas Tlaxcala Veracruz Yucatán Zacatecas When uploading the file in Orange it recognizes the column as text and not as categorical, is there any reason why these values cannot be identified as categorical?
Consider an Excel sheet containing a matrix of correlations between individual stocks and the combined portfolio as a whole: How can I extract an optimized matrix such that most stocks have a low correlation to each other? If I simply go by the combined column, when I remove the ones that don't fit the criteria, I have a feeling it might not be optimal, as the combined column is affected by the ones that were removed.
I have some real time sales and revenue data for a retailer for each day across customer type, state and channel. Sample data shared below: Sales Date Region State Customer Type Sales Channel 1/1/2015 South East Florida Employed 100 Kiosk 1/2/2015 South East Georgia Non-Employed 200 Dotcom 1/3/2015 South East Florida Employed 300 Dotcom 1/4/2015 South West Arizona Non-Employed 400 Kiosk 1/5/2015 South West Georgia Employed 500 Dotcom 1/7/2015 South West Texas Non-Employed 0 Kiosk Revenue Date Region State Customer …
I have a data set in the following form: Product | Date 123 | 2019-01-01 456 | 2019-01-01 123 | 2019-01-02 123 | 2019-01-03 456 | 2019-01-03 123 | 2019-01-04 456 | 2019-01-04 789 | 2019-01-04 This is just a simplified version. The full set has ~300 products and four months of data. I would like to understand how the product set changed over time. It is obviously easy to calculate the count per day and see that I lost …
I have multiple data frames with same column names. I want to write them together to an excel sheet stacked vertically on top of each other. And between each, there will be a text occupying a row. This is what I have in mind. I tried the pandas.ExcelWriter() method, but each dataframe overwrites the previous frame in the sheet, instead of appending. Note that, I still need multiple sheets for different dataframe, but also multiple dataframes on each sheet. Is …