InstaParseX is a Python-based project that parses raw Instagram profile text to extract usernames, followers, following, posts, and categories for structured analysis.
-
Updated
Jan 23, 2026 - Jupyter Notebook
InstaParseX is a Python-based project that parses raw Instagram profile text to extract usernames, followers, following, posts, and categories for structured analysis.
The Multi-Language Invoice Extractor leverages Google Gemini Pro LLM to extract key data from invoices in various languages. It uses advanced NLP and machine learning techniques to automate and optimize invoice data extraction across multiple languages.
Ollama-Profile is a lightweight RAG-based system that answers questions about me using structured personal data. It stores my information as vector embeddings in ChromaDB, routes queries to only relevant data chunks, and uses a local Phi-3 model via Ollama to generate accurate, context-grounded responses without hallucinations.
This Java application is crafted to convert unstructured data into structured JSON-LD format compliant with Schema.org standards. It features a main class that oversees the iteration of data transformation and employs the OpenAI GPT API for data structuring. Additionally, it incorporates a validation class dedicated to verifying that the structured
Add a description, image, and links to the datastructuring topic page so that developers can more easily learn about it.
To associate your repository with the datastructuring topic, visit your repo's landing page and select "manage topics."