Artificial Intelligence Zone

Researchers Shanghai AI Lab and SenseTime Propose MM-Grounding-DINO: An Open and Comprehensive Pipeline for Unified Object Grounding and Detection

Marktechpost

JANUARY 16, 2024

OVD models are trained on base categories in zero-shot scenarios but must predict both base and novel categories within a broad vocabulary. PG provides a phrase to describe candidate categories and output corresponding boxes, while REC accurately identifies a target from text and outlines its position using a bounding box.

AI

AI AI ML Computer Vision

Who Is Responsible If Healthcare AI Fails?

Unite.AI

JUNE 26, 2023

Both categories have their risks. Most AI today use “black box” logic, meaning no one can see how the algorithm makes decisions. Black box AI lack transparency, leading to risks like logic bias , discrimination and inaccurate results. Explainable AI — also known as white box AI — may solve transparency and data bias concerns.

Black Box AI

Black Box AI Explainable AI Explainability AI

Meet Dawn AI: An AI Analytics Start-Up Transforming User Requests and Model Outputs into Metrics

Marktechpost

APRIL 2, 2024

Dawn aims to address the black box problem by providing an all-encompassing analytics platform tailored to AI goods. Dawn AI’s key features are as follows: Dawn is a master of categorization/tokens; it can automatically sort user inputs and model outputs into useful categories. Funding Round Dawn is backed up by Y Combinator.

Categorization

Categorization AI AI AI Modeling

Webinars

How to Optimize the Developer Experience for Monumental Impact

Generative AI Deep Dive: Advancing from Proof of Concept to Production

Understanding User Needs and Satisfying Them

Leading the Development of Profitable and Sustainable Products

How To Get Promoted In Product Management

MORE WEBINARS

Apple Researchers Propose MAD-Bench Benchmark to Overcome Hallucinations and Deceptive Prompts in Multimodal Large Language Models

Marktechpost

FEBRUARY 29, 2024

Various categories of hallucinations in MLLMs include describing non-existent objects, misunderstanding spatial relationships, and counting objects incorrectly. The dataset includes six categories of deception: Count of Objects, Non-existent Object, Object Attribute, Scene Understanding, Spatial Relationship, and Visual Confusion.

Large Language Models

Large Language Models Prompt Engineer Prompt Engineering ML

New Study: 2018 State of Embedded Analytics Report

Why do some embedded analytics projects succeed while others fail? We surveyed 500+ application teams embedding analytics to find out which analytics features actually move the needle. Read the 6th annual State of Embedded Analytics Report to discover new best practices. Brought to you by Logi Analytics.

Artificial Intelligence

Researchers from Microsoft Research and Tsinghua University Proposed Skeleton-of-Thought (SoT): A New Artificial Intelligence Approach to Accelerate Generation of LLMs

Marktechpost

NOVEMBER 23, 2023

Unlike conventional methods, SoT refrains from making extensive changes to LLMs and treats them as black boxes instead. To evaluate the effectiveness of SoT, the research team conducted extensive tests on 12 recently released models, spanning both open-source and API-based categories.

Artificial Intelligence

Artificial Intelligence Artificial Intelligence Large Language Models LLM

TinySAM : Pushing the Boundaries for Segment Anything Model

Unite.AI

FEBRUARY 8, 2024

Owing to its exceptional performance on tasks segmenting objects with arbitrary categories and shapes, it serves as the foundation for frameworks performing downstream tasks like image inpainting, object tracking, 3D vision, and more. The model uses different networks to process the geometric and the text prompts.

Computer Vision

Computer Vision Artificial Intelligence Artificial Intelligence

Penetration testing methodologies and standards

IBM Journey to AI blog

JANUARY 24, 2024

The organization’s choice will depend on the category of the target organization, the goal of the pen test and the scope of the security test. There are a variety of tests the pen testers can do, including a black-box test, white-box test, and gray-box test. There is no one-size-fits-all approach.

Automation

Detect Anything You Want With UniDetector

Marktechpost

AUGUST 1, 2023

However, the challenge lies in the variation of object categories and scenes. The ideal large-scale learning dataset should include many image types, encompassing as many categories as possible, with high-quality bounding box annotations and extensive category vocabularies.

Deep Learning

Deep Learning ML AI Researcher AI Research

Balancing AI: Do good and avoid harm

IBM Journey to AI blog

JANUARY 25, 2024

Worker education and knowledge management are now tightly coordinated as a multi-stakeholder strategy with IT, legal, compliance and business operators as an ongoing process, as opposed to a once-a-year check box. This discrepancy exists because policies alone cannot eliminate the prevalence and increasing use of digital tools.

Responsible AI

Responsible AI AI AI Generative AI

Enhancing Vision-Language Models with Chain of Manipulations: A Leap Towards Faithful Visual Reasoning and Error Traceability

Marktechpost

FEBRUARY 16, 2024

In the initial training round, most VLMs learned a plethora of intrinsic multimodal abilities, such as grounding boxes and word recognition. boxes, messages, images) by applying a sequence of manipulations to the visual input. Models can execute evidential visual reasoning for issue-solving by mimicking basic human-like behaviors (e.g.,

Large Language Models

Large Language Models Machine Learning Automation ML

Foundational vision models and visual prompt engineering for autonomous driving applications

AWS Machine Learning Blog

NOVEMBER 15, 2023

Visual prompts can include bounding boxes or masks that guide vision models in generating relevant and accurate outputs. This extensive dataset covers a wide range of objects and categories, providing SAM with a diverse and large-scale training data source. In visual prompting, the bounding boxes, points, or masks are the input slots.

Prompt Engineer

Prompt Engineer Prompt Engineering Computer Vision Machine Learning

GDPR compliance checklist

IBM Journey to AI blog

JANUARY 22, 2024

Affirmative consent means the user must take some intentional action to show consent, such as by signing a statement or checking a box. The organization takes extra precautions when processing children’s data or special category data. Special category data includes highly sensitive data like a person’s race and biometrics.

Automation

Automation Explainability

Grounded-SAM Explained: A New Image Segmentation Paradigm?

Viso.ai

MARCH 19, 2024

It leverages DINO (Distilled Knowledge from Internet pre-trained mOdels), to interpret free-form text and generate precise bounding boxes and labels for objects within images. On top of textual description, it can also process prompts as bounding boxes or points. Vision Transformers (ViTs) form the backbone of this model. ODISE-L 38.7

Explainability

Explainability Computer Vision Machine Learning ChatGPT

SalesForce AI Researchers Introduce Mask-free OVIS: An Open-Vocabulary Instance Segmentation Mask Generator

Marktechpost

JUNE 19, 2023

However, there is a certain downside to existing detection models regarding the number of base categories they can identify. Previous trials have indicated that if a detection model is trained on the COCO dataset, its capability to detect approximately 80 categories can be attained.

AI Researcher

AI Researcher AI Research Convolutional Neural Networks Computer Vision

Enhancing Machine Learning Reliability: How Atypicality Improves Model Performance and Uncertainty Quantification

Marktechpost

DECEMBER 11, 2023

An object is considered typical if it resembles other items in its category. Several cognitive science studies imply that typicality is essential to category knowledge. Even logistic regression and neural networks might have incorrect calibrations right out of the box.

Machine Learning

Machine Learning Neural Network Categorization AI Researcher

Best Image Annotation Tools in 2024

Marktechpost

JANUARY 22, 2024

An image may be annotated in many ways, such as by drawing bounding boxes around items, titling them, or segmenting them according to their visual characteristics. Keylabs Keylabs enables users to annotate images with captions, tags, and other information, such as bounding boxes, important points, and semantic segmentation.

Computer Vision

Computer Vision Machine Learning ML Deep Learning

Create high-quality datasets with Amazon SageMaker Ground Truth and FiftyOne

AWS Machine Learning Blog

MAY 5, 2023

To create this app, they need a high-quality dataset containing clothing images, labeled with different categories. Fortunately, these images have already been cropped to the object detection bounding boxes, so we can focus on classification, rather than worry about object detection.

Metadata

Metadata Computer Vision Machine Learning Data Scientist

RO-ViT: Region-aware pre-training for open-vocabulary object detection with vision transformers

Google Research AI blog

AUGUST 28, 2023

To overcome this, the open-vocabulary detection task (OVD) has emerged, utilizing image-text pairs for training and incorporating new category names at test time by associating them with the image content. By treating categories as text embeddings, open-vocabulary detectors can predict a wide range of unseen objects. mask AP r.

Convolutional Neural Networks

Convolutional Neural Networks Computer Vision Neural Network

The COCO dataset: All you need to know

Mlearning.ai

FEBRUARY 18, 2024

Object detection with COCO: Every object contained in the dataset comes with annotations comprising a bounding box and an associated class label. In the following section, we will delve into each of these problem types to foster a comprehensive understanding.

Computer Vision

Computer Vision Machine Learning Algorithm ML

Segment anything with ONNX Runtime using Azure Machine Learning

Mlearning.ai

SEPTEMBER 2, 2023

imread('images/truck.jpg') image = cv2.cvtColor(image, cvtColor(image, cv2.COLOR_BGR2RGB) COLOR_BGR2RGB) plt.figure(figsize=(10,10)) plt.imshow(image) plt.axis('on') plt.show() Use onnx model now to predict ort_session = onnxruntime.InferenceSession(onnx_model_path) Set GPU sam.to(device='cuda') cvtColor(image, cv2.COLOR_BGR2RGB)

Machine Learning

Machine Learning ML Python AI

Open Images V7 — Now Featuring Point Labels

Google Research AI blog

OCTOBER 25, 2022

Posted by Rodrigo Benenson, Research Scientist, Google Research Open Images is a computer vision dataset covering ~9 million images with labels spanning thousands of object categories. By focusing on point labels, we expanded the number of images annotated and categories covered. On average these images have annotations for 6.7

Computer Vision

Computer Vision Machine Learning ML Automation

How to implement the General Data Protection Regulation (GDPR)

IBM Journey to AI blog

FEBRUARY 23, 2024

Identify and protect special category data When inventorying data, organizations should make a note of any especially sensitive data that requires extra protection. The GDPR mandates added precautions for three kinds of data in particular: special category data, criminal conviction data, and children’s data.

Automation

Automation Explainability OpenAI ChatGPT

Inside LlaVA: The First Open Source GPT-4V Alternative

Towards AI

OCTOBER 30, 2023

These representations fall into two categories: · Captions: These serve as textual descriptions that offer diverse perspectives on the visual scene. Bounding Boxes: These handy boxes serve to pinpoint and delineate objects within the scene. Each box encodes not only the object concept but also its spatial location.

Large Language Models

Large Language Models LLM Machine Learning ChatGPT

Microsoft Researchers Propose a Novel Framework for LLM Calibration Using Pareto Optimal Self-Supervision without Using Labeled Training Data

Flipboard

JULY 3, 2023

There are two basic categories of methods for evaluating the degree of confidence in LLM replies. The second category of options turns to outside sources of data, such as hiring human reviewers to verify the answer or using huge amounts of labeled data to create assessment models.

LLM

LLM Large Language Models AI Tools NLP

Meet DreamSync: A New Artificial Intelligence Framework to Improve Text-to-Image (T2I) Synthesis with Feedback from Image Understanding Models

Marktechpost

DECEMBER 5, 2023

With 4K prompts and 25K questions, TIFA facilitates evaluation across 12 categories. Future enhancements for DreamSync include grounding feedback with detailed annotations like bounding boxes for identifying misalignments. Previous studies proposed using VQA models, exemplified by TIFA, to assess T2I generation.

Artificial Intelligence

Artificial Intelligence Artificial Intelligence AI Researcher AI Research

The Evolution of ImageNet and Its Applications

Viso.ai

FEBRUARY 11, 2024

Over one million annotated images with bounding boxes. The first ILSVRC, a subset of ImageNet, used a set of only 1000 image categories (classes) and was able to classify 90 of the 120 dog breeds. The hierarchy is composed of nodes that define the categories. Each category is described by a synset (a set of meaningful phrases).

Convolutional Neural Networks

Convolutional Neural Networks Computer Vision Neural Network Deep Learning

Swin Transformer: A Novel Hierarchical Vision Transformer for Object Recognition

Heartbeat

JULY 6, 2023

CNNs are capable of learning features that are robust to variations in object appearance, lighting, and orientation, and can be trained to identify specific object categories such as people, cars, animals, and other objects. numpy() for box, score in zip(boxes, scores): if score > 0.8: What is the Swin Transformer?

Computer Vision

Computer Vision Convolutional Neural Networks Deep Learning Neural Network

Training YOLOv4 on Google Colab

Mlearning.ai

JUNE 18, 2023

We need to extract all individual bounding box annotations from each.xml file, reformat the bounding box, and then save the annotations to a .txt As the bounding box annotations for this particular dataset are in the Pascal VOC format, we need to modify them to the YOLO format for YOLO to ingest. xml file in annotations.

Deep Learning

Deep Learning Computer Vision Python ML

Researchers from the University of UT Austin Introduce PSLD: An AI Method that Uses Stable Diffusion to Solve All Linear Problems Without Any Extra Training

Marktechpost

JULY 16, 2023

For solving inverse problems, there are two categories of approaches: supervised techniques, where a restoration model is trained to complete the task, and unsupervised methods, where a generative model uses the prior it has learned to direct the restoration process.

Algorithm

Algorithm AI Tools AI AI

Peeking Inside Pandora’s Box: Unveiling the Hidden Complexities of Language Model Datasets with ‘What’s in My Big Data’? (WIMBD)

Marktechpost

NOVEMBER 5, 2023

They classify their analyses into four categories: Data statistics (e.g., The post Peeking Inside Pandora’s Box: Unveiling the Hidden Complexities of Language Model Datasets with ‘What’s in My Big Data’? number of tokens and domain distribution). Data quality (e.g., We are also on Telegram and WhatsApp.

Big Data

Big Data Machine Learning Data Quality AI Researcher

LLMs and Data-to-text

Ehud Reiter

JUNE 28, 2023

Most “leaderboard” data-to-text tasks, such as E2E and WebNLG, fit into this category. Specifics depend on use case and work flow, but this can be a major challenge for black-box neural systems. In some cases, developers must provide proof (eg to regulators) that this is the case.

LLM

LLM ChatGPT

The risks and limitations of AI in insurance

IBM Journey to AI blog

MAY 8, 2023

Risk and limitations of AI The risk associated with the adoption of AI in insurance can be separated broadly into two categories—technological and usage. Technological risk—transparency The black-box characteristic of AI systems, especially generative AI, renders the decision process of AI algorithms hard to understand.

AI

AI AI Algorithm Generative AI

Multivariate Time Series Forecasting

Mlearning.ai

JULY 2, 2023

The Art of Forecasting in the Retail Industry Part I : Exploratory Data Analysis & Time Series Analysis In this article, I will conduct exploratory data analysis and time series analysis using a dataset consisting of product sales in different categories from a store in the US between 2015 and 2018.

Categorization

Categorization Data Analysis Data Science ML

Data Validation in MS Excel: A Guide

Pickl AI

SEPTEMBER 22, 2023

Users can choose from a number of items with ease by using combo boxes or dropdown lists, which you can construct. For categorical data like product categories or department names, this is extremely helpful. Access the Data Validation Dialog Box: Go to the “Data” tab on the Excel ribbon.

Data Analysis

Data Analysis Data Integration Categorization Data Science

How to become an AI+ enterprise

IBM Journey to AI blog

MARCH 4, 2024

Do you use gen AI out of the box? These outcomes typically fall into one of three categories, none of which are desirable: Not useful: Customers remain unimpressed with your results. Should you build your own? If so, where will it run? How can you master prompt engineering? When should you prompt-tune or fine-tune?

AI

AI AI Artificial Intelligence Artificial Intelligence

Researchers from Princeton Introduce Infinigen: A Procedural Generator of Photorealistic 3D Scenes of the Natural World

Marktechpost

JUNE 26, 2023

Additionally, utilities have been developed to render synthetic images with ground truth labels, providing information such as depth, occlusion boundaries, bounding boxes, optical flow, surface normals, object categories, and instance segmentation.

Computer Vision

Computer Vision AI Tools Python AI Researcher

Achieving accurate image segmentation with limited data: strategies and techniques

deepsense.ai

FEBRUARY 6, 2024

Typically, we can classify segmentation tasks into four categories: Semantic Segmentation aims to associate a label with every pixel in an image. These prompts can take various forms, such as a point, bounding box, initial binary mask, or even text, indicating what specific area of the image to segment. Source: [link].

Prompt Engineer

Prompt Engineer Prompt Engineering NLP Computer Vision

Just Calm Down About GPT-4 Already

Flipboard

MAY 17, 2023

I think what we’re going to see—and I’ve seen a bunch of papers recently about boxing in large language models—is much smoother language interfaces, input and output. But you have to box things in carefully so that the craziness doesn’t come out, and the making stuff up doesn’t come out. “I Didn’t at all. It was a total flop.

Large Language Models

Large Language Models Robotics Convolutional Neural Networks Neural Network

Mapping Medical Terms to MedDRA Ontology Using Healthcare NLP

John Snow Labs

APRIL 18, 2024

MedDRA is structured hierarchically; System Organ Classes (SOCs): SOCs are general categories that represent different body systems or medical areas. HLGTs further specify the categories within a SOC. High-Level Group Terms (HLGTs): Within each SOC are HLGTs. They group similar medical conditions or diseases.

NLP

NLP BERT Categorization Automation

Semantic vs Instance Segmentation (2024 Update)

Viso.ai

JANUARY 4, 2024

Unlike simple segmentation that might just separate foreground from background, semantic segmentation categorizes all pixels in an image into predefined categories. Each pixel in the image is classified and segmented to represent distinct objects or regions based on semantic categories. There are a few different dimensions to this.

Computer Vision

Computer Vision Convolutional Neural Networks Deep Learning Neural Network

YOLO-World: Real-Time Open-Vocabulary Object Detection

Unite.AI

MARCH 15, 2024

However, these models have a fixed vocabulary, limited to detecting objects within the 80 categories of the COCO dataset. This limitation stems from the training process, where object detectors are trained to recognize only specific categories, thus limiting their applicability.

Neural Network

Neural Network Computer Vision Categorization Algorithm

How to Connect Text and Images

Becoming Human

MARCH 16, 2023

Methods like unsupervised learning also fail in scenarios where different sub-categories of the same object need to be classified — for instance, trying to identify different breeds of dogs. Even if search engines may be trained on dozens of different categories of images, individuals can still supply them with new things to look for.

Computer Vision

Computer Vision Deep Learning Machine Learning Artificial Intelligence

Azure Open AI with Power Apps

Mlearning.ai

FEBRUARY 6, 2023

The Internet’s most beloved cooking guru has a buzzy new book and a fresh new perspective: Classified category:"}) Now add another button to Parse Unstructed Data UpdateContext({openaitext:"There are many fruits that were found on the recently discovered planet Goocrux. mi) and a mass of about 1.4 solar masses.[3]

AI

AI AI OpenAI Large Language Models

Researchers Shanghai AI Lab and SenseTime Propose MM-Grounding-DINO: An Open and Comprehensive Pipeline for Unified Object Grounding and Detection

Who Is Responsible If Healthcare AI Fails?

Webinars

Trending Sources

Meet Dawn AI: An AI Analytics Start-Up Transforming User Requests and Model Outputs into Metrics

Webinars

Apple Researchers Propose MAD-Bench Benchmark to Overcome Hallucinations and Deceptive Prompts in Multimodal Large Language Models

New Study: 2018 State of Embedded Analytics Report

Researchers from Microsoft Research and Tsinghua University Proposed Skeleton-of-Thought (SoT): A New Artificial Intelligence Approach to Accelerate Generation of LLMs

TinySAM : Pushing the Boundaries for Segment Anything Model

Penetration testing methodologies and standards

Detect Anything You Want With UniDetector

Balancing AI: Do good and avoid harm

Enhancing Vision-Language Models with Chain of Manipulations: A Leap Towards Faithful Visual Reasoning and Error Traceability

Foundational vision models and visual prompt engineering for autonomous driving applications

GDPR compliance checklist

Grounded-SAM Explained: A New Image Segmentation Paradigm?

SalesForce AI Researchers Introduce Mask-free OVIS: An Open-Vocabulary Instance Segmentation Mask Generator

Enhancing Machine Learning Reliability: How Atypicality Improves Model Performance and Uncertainty Quantification

Best Image Annotation Tools in 2024

Create high-quality datasets with Amazon SageMaker Ground Truth and FiftyOne

RO-ViT: Region-aware pre-training for open-vocabulary object detection with vision transformers

The COCO dataset: All you need to know

Segment anything with ONNX Runtime using Azure Machine Learning

Open Images V7 — Now Featuring Point Labels

How to implement the General Data Protection Regulation (GDPR)

Inside LlaVA: The First Open Source GPT-4V Alternative

Microsoft Researchers Propose a Novel Framework for LLM Calibration Using Pareto Optimal Self-Supervision without Using Labeled Training Data

Meet DreamSync: A New Artificial Intelligence Framework to Improve Text-to-Image (T2I) Synthesis with Feedback from Image Understanding Models

The Evolution of ImageNet and Its Applications

Swin Transformer: A Novel Hierarchical Vision Transformer for Object Recognition

Training YOLOv4 on Google Colab

Researchers from the University of UT Austin Introduce PSLD: An AI Method that Uses Stable Diffusion to Solve All Linear Problems Without Any Extra Training

Peeking Inside Pandora’s Box: Unveiling the Hidden Complexities of Language Model Datasets with ‘What’s in My Big Data’? (WIMBD)

LLMs and Data-to-text

The risks and limitations of AI in insurance

Multivariate Time Series Forecasting

Data Validation in MS Excel: A Guide

How to become an AI+ enterprise

Researchers from Princeton Introduce Infinigen: A Procedural Generator of Photorealistic 3D Scenes of the Natural World

Achieving accurate image segmentation with limited data: strategies and techniques

Just Calm Down About GPT-4 Already

Mapping Medical Terms to MedDRA Ontology Using Healthcare NLP

Semantic vs Instance Segmentation (2024 Update)

YOLO-World: Real-Time Open-Vocabulary Object Detection

How to Connect Text and Images

Azure Open AI with Power Apps

Stay Connected