Closed
Conversation
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Cloud agent has begun work on Tell me what is goin... and will update this pull request as work progresses.
Original prompt
Tell me what is going on
Embeddings computed. Shape: (1478, 768)
Embedding this many images: 1843
Processing Images
Traceback (most recent call last):
File "/workspace/scripts/python_helpers/run_embedding_pipeline.py", line 340, in
main()
File "/workspace/scripts/python_helpers/run_embedding_pipeline.py", line 282, in main
process_pdfs(
File "/workspace/scripts/python_helpers/run_embedding_pipeline.py", line 39, in process_pdfs
pdf_to_txt_img_time, text_embed_time, img_embed_time = processor.pdfs_to_embeddings(
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/workspace/govscape/govscape/pdf_to_embed.py", line 379, in pdfs_to_embeddings
self.compute_image_embeddings()
File "/workspace/govscape/govscape/pdf_to_embed.py", line 350, in compute_image_embeddings
emb = self.visual_model.encode_images(img_paths)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/workspace/govscape/govscape/visual_embedding_models.py", line 163, in encode_images
batch_tensors = pool.starmap(
^^^^^^^^^^^^^
File "/usr/local/lib/python3.11/multiprocessing/pool.py", line 375, in starmap
return self._map_async(func, iterable, starmapstar, chunksize).get()
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/usr/local/lib/python3.11/multiprocessing/pool.py", line 774, in get
raise self._value
multiprocessing.pool.MaybeEncodingError: Error sending result: '[tensor([[[[1.9303, 1.9303, 1.9303, ..., 1.9303, 1.9303, 1.9303],
[1.9303, 1.9303, 1.9303, ..., 1.9303, 1.9303, 1.9303],
[1.9303, 1.9303, 1.9303, ..., 1.9303, 1.9303, 1.9303],
...,
[1.9303, 1.9303, 1.9303, ..., 1.9303, 1.9303, 1.9303],
[1.9303, 1.9303, 1.9303, ..., 1.9303, 1.9303, 1.9303],
[1.9303, 1.9303, 1.9303, ..., 1.9303, 1.9303, 1.9303]],
Created from VS Code.
💬 We'd love your input! Share your thoughts on Copilot coding agent in our 2 minute survey.