Questions tagged [python]
For questions related to AI implementation in the Python language
365 questions
0
votes
0
answers
14
views
Can I create a layer in Keras that de-scales my output?
I am working on a keras regression network that takes about 60 input variables and outputs 35 variables. For both the input and output, about half of the variables are in the range of ±10, while the ...
0
votes
0
answers
13
views
Langchain agent state preservation
I've been trying to create my first agent using Langchain. I would like some way to store user details, what questions they may be asking etc to be able to effectively govern how the LLM responds. It'...
0
votes
0
answers
7
views
How to correctly extract the CLS token from a Keras Hub ViT backbone, and clarify preprocessor usage and pretraining dataset?
I’m working with a Vision Transformer (ViT) backbone from Keras Hub and building my own classification head. My code looks like this:
python
...
1
vote
0
answers
15
views
How to generate a “clinical care plan template” from a set of historical documents?
I have a corpus of clinical-domain documents (e.g., prior reports / care plans / discharge summaries).
Given new user/patient info (e.g., suspected disease/condition, age, symptoms, comorbidities), I ...
0
votes
0
answers
51
views
Downloaded model works inconsistently
I have a piece of coe, that uses huggingface model, and it works just fine:
...
1
vote
0
answers
48
views
Can’t reproduce perplexity distributions/peak in synthetic self-training (‘model collapse’) experiment (OPT-125m)
I investigate this paper: https://www.nature.com/articles/s41586-024-07566-y
They published some code here: https://zenodo.org/records/10866595
My generated text collapses across generations, but my ...
0
votes
0
answers
96
views
Is it safe to assume that MCP server + OpenWebUI + llm an agent?
I am relatively new to MCP server and into this world of AI agents and have been trying my best to understand.
So far, in my understand it is safe to call something( LLM + tools + loop until the job ...
1
vote
0
answers
128
views
How to properly change Yolov8 yaml file and improve accuracy comparison with normal training?
I'm trying to increase the mAP value of the RTTS dataset, and for this, I've made changes to yolo8's architecture and added new layers.
With the standard yolov8n.pt ...
0
votes
1
answer
181
views
What are the best Python library to implementation neural network modification algorithms? [closed]
I want to implement in python some algorithms from a paper that allow for a pre-trained neural network to be modified (adding or removing neurons or layers) conserving (theoretically) the outputs of ...
0
votes
1
answer
184
views
Why are certain image-to-text models giving me weird responses that look like prompts that a human might give?
DISCLAIMER:
The following is a discussion of abliterated/uncensored image analysis using AI. If this at all offends you, or you are under the age of legal consent in your country, please do not read ...
0
votes
0
answers
41
views
How to make sure my AI code correctly uses Google search?
I am using the following (straightforward) code to run a prompt with some text and images to Google Gemini. The system instruction is quite extensive but clearly states to 1) use the tool, and 2) ...
1
vote
1
answer
135
views
DQN is not learning in Atari Pong environment and I can't figure out where I'm messing up
I'm trying to implement the findings from this DeepMind DQN paper (2015) from scratch in PyTorch using the Atari Pong environment.
I've tested my Deep Q-Network on a simple test environment, where ...
0
votes
0
answers
76
views
Use TensorFlow's model in OpenCV (C++)
I've trained a model to recognize some captchas and the model is doing great. I've trained it using this tutorial (actually I just stole the code, tweaked it and run :) ). Now, I'd like to export it ...
0
votes
0
answers
175
views
Model or LLM that is fast enough to describe an image in detail
The heading might be little weird, but let's get on the point.
I made an chat-bot like application where user can upload video and cant chat/ask anything about the video content, just like we talk to ...
0
votes
0
answers
34
views
How can I integrate LangSmith for observability in a multi-agent Autogen (AG2) GroupChat setup?
I'm working on a document analysis service using Autogen (AG2). The service has two main agents: a reader and an analyzer. The reader splits the document into chunks and sends them to the analyzer, ...