Skip to main content

Questions tagged [python]

Use for data science questions related to the programming language Python. Not intended for general coding questions (which should be asked on Stack Overflow).

Filter by
Sorted by
Tagged with
4 votes
1 answer
467 views

I'm currently working in a dataset with censorship on profanity. Basically, fuck would be 4 heart emojis. Considering I'm trying to run a topic modelling w/ BERTopic, what kinda of preprocessing would ...
Gabriel Fagundes's user avatar
0 votes
0 answers
11 views

I'm training CSDI model and the output is very suspicious? Low diffusion steps? too high learning rate? nothing seems to change this behavior? some normalization issue that I'm not accounting for, I'...
cdt123's user avatar
  • 141
5 votes
1 answer
182 views

I am working with multiple data files in a folder where some files are in CSV format and others are in JSON format. I want to combine all of them into a single DataFrame for further analysis. Here is ...
Iformatio's user avatar
2 votes
0 answers
15 views

The documentation is a little bit confusing so I thought I would ask here to make sure, I'm using: ...
cdt123's user avatar
  • 141
5 votes
1 answer
54 views

I'm implementing this paper and trying to train it on a generated data and return full ground truths and a single forecast but the forecasts my model is producing are not anchored to the past series ...
cdt123's user avatar
  • 141
7 votes
4 answers
336 views

if I have the binary matrix which looks something like this (this is only 10 rows of binary matrix, I have a dataset of a million rows, so you can see what the binary matrix looks like): ...
machine_learning_student's user avatar
2 votes
0 answers
44 views

I am a 6th sem student and my mini project is IR pedestrian detection using yolo v8. My job is to train GAN(Pix2Pix-based) for the generation of synthetic IR images. I will give my code below. My 99th ...
Oh na na's user avatar
5 votes
0 answers
30 views

I have build and trained a NMT model using Rnn in Google colab and Now when I am trying to predict for my test data my Google colab session keeps on crashing . The shape of my test data is 47838×55 ...
swar_codes's user avatar
6 votes
1 answer
129 views

I'm learning about classification using decision trees. I'm using DecisionTreeClassifier function in the scikit-learn library in Python to train the model on training data (yields fully grown tree), ...
aweller3's user avatar
2 votes
0 answers
32 views

Given a text resource (Corpus/novel/...) I want to find pair of words that 1) appear statistically significantly together and 2) extract contextual knowledge from these pairs. For simplicity I'm ...
MissingSemikolonAt63's user avatar
5 votes
1 answer
120 views

I am working on a movie recommendation problem where I get multiple files from the source, and the total data size is around 900 MB. I am using the ...
RushHour's user avatar
  • 259
5 votes
1 answer
80 views

In an structure, IT has deployed a strict proxy policy (no specific right for any people). Windows 11 is the OS installed for every people, in a strict way. To run datascience tasks using python, in ...
an ch's user avatar
  • 51
1 vote
0 answers
20 views

I performed Random Forest–based feature importance analysis on the MNIST dataset, focusing only on digits 0 and 1. When I visualize the importance map (see image below), it doesn’t resemble the ...
Manish Yadav's user avatar
2 votes
0 answers
27 views

I have a large set of phrases obtained via Azure Fast Transcription, and I need to group them into coherent semantic chunks (to use later in a RAG pipeline). Initially, I tried grouping phrases based ...
Daniel's user avatar
  • 21
0 votes
0 answers
26 views

So like I have a bunch of fingerprint as a data set (my college gave me). Now I want to use these fingerprint as datasets and train a model to understand the different things. That is beside the point....
Sayan's user avatar
  • 1

15 30 50 per page
1
2 3 4 5
442