Computer Vision (1-10 of 21 results)

Festival celebration

by

Festival celebration videos

2K
VIDEOS

Weather, calamities, calamities aftermath Videos

by

Weather, calamities, calamities aftermath Videos

5K
VIDEOS

FileMarket | Text Recognition Data | 50,000 Images | Computer Vision Data | AI Model Training Data | Textual data | Annotated Imagery Data

by

Pre-collected OCR datasets include images of natural scenes, handwritten texts, bills and documents, and test papers. The AI training data spans 20 languages, various natural environments, and diverse photographic angles. Annotated Imagery Data FileMarket provides a robust Annotated Imagery Data set designed to meet the diverse needs of various computer vision and machine learning tasks. This dataset is part of our extensive offerings, which also include Textual Data, Object Detection Data, Large Language Model (LLM) Data, and Deep Learning (DL) Data. Each category is meticulously crafted to ensure high-quality and comprehensive datasets that empower AI development. Specifications: Data Size: 50,000 images Collection Environment: The images cover a wide array of real-world scenarios, including shop signs, stop boards, posters, tickets, road signs, comics, cover pictures, prompts/reminders, warnings, packaging instructions, menus, building signs, and more. Diversity: The dataset spans 5 languages and includes images from various natural scenes captured at multiple photographic angles (looking up, looking down, eye-level). Devices Used: Images are captured using cellphones and cameras, reflecting real-world usage. Image Parameters: All images are provided in .jpg format, and the corresponding annotation files are in .json format. Annotation Details: The dataset includes line-level quadrilateral bounding box annotations and text transcriptions. Accuracy: The error margin for each vertex of the quadrilateral bounding box is within 5 pixels, ensuring bounding box accuracy of at least 97%. The text transcription accuracy also meets or exceeds 97%. Unique Data Collection Method: FileMarket utilizes a community-driven approach to collect data, leveraging our extensive network of over 700k users across various Telegram apps. This method ensures that our datasets are diverse, real-world applicable, and ethically sourced, with full participant consent. This approach allows us to provide datasets that are both comprehensive and reflective of real-world scenarios, ensuring that your AI models are trained on the most relevant and diverse data available. By integrating our unique data collection method with the specialized categories we offer, FileMarket is committed to providing high-quality data solutions that support and enhance your AI and machine learning projects.

50K
IMAGES

FileMarket | Diverse Human Face Data | 20,000 IDs | Face Recognition Data | Image/Video AI Training Data | Biometric Data

by

Our pre-compiled biometric data set (human faces) includes comprehensive features such as 3D depth, segmentation of facial organs and accessories, key points, facial expressions, alpha matte, and a range of ages. All biometric data is gathered with signed authorization agreements. Biometric Data FileMarket provides a comprehensive Biometric Data set, ideal for enhancing AI applications in security, identity verification, and more. In addition to Biometric Data, we offer specialized datasets across Object Detection Data, Machine Learning (ML) Data, Large Language Model (LLM) Data, and Deep Learning (DL) Data. Each dataset is meticulously crafted to support the development of cutting-edge AI models. Data Size: 20,000 IDs Race Distribution: The dataset encompasses individuals from diverse racial backgrounds, including Black, Caucasian, Indian, and Asian groups. Gender Distribution: The dataset equally represents all genders, ensuring a balanced and inclusive collection. Age Distribution: The data spans a broad age range, including young, middle-aged, and senior individuals, providing comprehensive age coverage. Collection Environment: Data has been gathered in both indoor and outdoor environments, ensuring variety and relevance for real-world applications. Data Diversity: This dataset includes a rich variety of face poses, racial backgrounds, age groups, lighting conditions, and scenes, making it ideal for robust biometric model training. Device: All data has been collected using mobile phones, reflecting common real-world usage scenarios. Data Format: The data is provided in .jpg and .png formats, ensuring compatibility with various processing tools and systems. Accuracy: The labels for face pose, race, gender, and age are highly accurate, exceeding 95%, making this dataset reliable for training high-performance biometric models.

20K
VIDEOS

Vehicle traffic

by

Vehicle traffic videos

8K
VIDEOS

Factori Global Mobility Data

by

High-quality mobility data aggregated from multiple location-aware mobile apps and SDKs globally. This dataset provides comprehensive insights into movement patterns with daily updates. All data is collected with explicit user consent and anonymized following privacy standards. Key features: 90 billion location records globally Daily data collection and delivery Complete device movement data, including coordinates, timestamps, and accuracy metrics Geographic data including country, state, city, and postal codes Detailed device informatio,n including carriers and user agents Advanced location encoding via geohash, hex8, and hex9 systems Perfect for consumer insight analysis, market intelligence, targeted advertising, and retail analytics applications. Data is available in daily/weekly/monthly/quarterly delivery options.

90000M
TEXTS

Factori Global Visit Data

by

Factori Visit Data connects people's movements to over 200 million physical locations globally, powering geographical information system (GIS) tools and providing data-driven insights across multiple industries. These aggregated and anonymized data points offer valuable context for the volume and patterns of visits to locations worldwide. Key features: -POI/Place/OOH level insights based on Factori's Mobility & People Graph data -Foot-traffic attribution using combined location attributes -Time-based analysis by day of week and part of day -Home and work location distribution of visitors -Visitor country origin and travel patterns -Visitor demographic breakdowns including age and gender -Device brand, model, and carrier information -Place category and brand affinity metrics -Geo-behavioral interest mapping Perfect for credit scoring applications, retail analytics, market intelligence, and urban planning. Financial services can validate locations for alternative credit scoring, retailers can analyze footfall trends, marketers can study competitive landscapes, and urban planners can build cases for development based on fresh population data. Data is collected dynamically and provided through flexible delivery schedules.

200M
TEXTS

FileMarket | 20,000 Photos of Palms | AI Training Data | Large Language Model (LLM) Data | Machine Learning (ML) Data | Deep Learning (DL) Data

by

Overview: FileMarket's dataset provides 20,000 high-resolution images of palms, captured in a controlled environment to ensure consistent lighting and clarity. The dataset features a variety of palm types, from different angles and lighting conditions, making it an ideal resource for training AI models in areas such as object detection, plant recognition, and environmental applications. What Makes This Data Unique? This dataset is distinctive for its comprehensive and diverse representation of palms. The images were carefully captured by professional photographers in a studio setting, ensuring uniformity in quality and lighting. The wide range of palm types, along with various angles and poses, allows for nuanced model training, including distinguishing between species, leaf shapes, and growth patterns. The consistency of the imagery eliminates the need for excessive preprocessing, enabling quicker integration into machine learning and deep learning workflows. Data Sourcing: The palm images were sourced through professional shoots in a studio environment, guaranteeing consistency across the dataset. Each image is shot with optimal lighting and framing to enhance visual clarity. The photographers have experience in nature and botanical photography, ensuring that each photo is of exceptional quality and is suited for scientific and technical applications. Primary Use-Cases: This dataset can be leveraged in a wide array of AI and machine learning contexts, including: Object Detection Data: The high clarity and consistent imagery make it perfect for training models that focus on detecting palm trees, their leaves, and different types of foliage. Machine Learning (ML) Data: The diversity of palm species and the variety of captured angles provide a robust dataset for training models aimed at plant identification, classification, and recognition. Deep Learning (DL) Data: The multi-angle shots of palms are ideal for deep learning applications that require complex features, such as image segmentation, object tracking, and even 3D reconstruction of plant structures. Environmental AI Applications: With detailed imagery, this dataset is suited for models used in environmental analysis, where palm trees play a role in ecosystem recognition or climate change studies. Broader Data Offering: This dataset is a valuable addition to FileMarket’s extensive data offerings. It can be easily integrated with other datasets, such as those related to geography, climate, or biodiversity, creating more holistic AI models. Whether you are developing applications for botany research, environmental monitoring, or advanced plant recognition, this dataset is a foundational asset for AI training.

20K
IMAGES

Factori Global Web Data

by

Factori Web Data contains fresh web browsing data of users across desktop and mobile devices, indicating search intent, purchase intent, and online category interests. This comprehensive dataset tracks user activity across popular websites worldwide, delivered as a daily feed via server-to-server transfer. Key features: -Over 2 billion records of web browsing activity -Daily data collection with daily delivery frequency -Six months of historical data accessible -Anonymous user identification across devices -IP address data for geographic segmentation -Search query capture for intent analysis -Website category classification -Cross-device browsing behavior patterns -Interest and intent indicators from browsing activity Perfect for personalized targeting applications, data enrichment projects, market intelligence analysis, and fraud detection/cybersecurity initiatives. This dataset allows organizations to analyze web behavior patterns and build highly accurate audience segments based on web activity for targeting ads based on interest categories and search/browsing intent. Data is collected dynamically and provided through suitable delivery methods on daily, weekly, or monthly intervals.

2000M
TEXTS

Factori Global High-Fidelity Mobility Data

by

Factori High Fidelity Mobility data is collected from location-aware partner mobile apps. This dataset includes advanced attributes such as vertical accuracy and altitude measurements to provide exceptionally accurate location intelligence. Key features: -Over 200 billion high-precision location records -Daily data collection with same-day delivery frequency -Enhanced vertical accuracy for multi-level location determination -Altitude measurements for 3D positioning -Comprehensive user demographic information -Anonymous ID linking for privacy protection -Detailed device information -Affluence indicators and economic attributes -Interest categorization and behavioral segments -Travel patterns including visited countries This dataset is used in a wide range of business applications, such as consumer insights applications, targeted advertising campaigns, and retail analytics. All data is collected with explicit opt-in consent and anonymized to ensure privacy compliance. Data is delivered daily with options for custom delivery schedules.

200000M
TEXTS
Showing datasets per page