Loading…
Mitigation of Accuracy Degradation in 3D Flash Memory Based Approximate Nearest Neighbor Search with Binary Tree Balanced Soft Clustering for Retrieval-Augmented AI
Computing-in-memory (CIM) can suppress the energy consumed in transferring data from the memory to the processor. However, implementation of CIMs is often impractical due to the overhead of adding circuits on memory to perform high-precision processing. In this paper, we propose techniques to realiz...
Saved in:
Main Authors: | , , , , , , , , , , |
---|---|
Format: | Conference Proceeding |
Language: | English |
Subjects: | |
Online Access: | Request full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | Computing-in-memory (CIM) can suppress the energy consumed in transferring data from the memory to the processor. However, implementation of CIMs is often impractical due to the overhead of adding circuits on memory to perform high-precision processing. In this paper, we propose techniques to realize CIM using a standard 3D flash memory for approximate nearest neighbor search (ANNS), and demonstrate their application to retrieval-augmented AI. To maintain high computational efficiency, we introduce binary tree balanced soft clustering, which divides the dataset into the page size while avoiding accuracy degradation. Furthermore, we introduce similarity distribution learning, which focuses on the tail of the distribution to prevent accuracy degradation in data transformation for CIM. We conducted ImageNet-1k image classification experiments as an evaluation of retrieval-augmented AI including ANNS, and achieved top-1 accuracy of 77.7%, which is comparable to digital processing. |
---|---|
ISSN: | 2474-9672 |
DOI: | 10.1109/NewCAS58973.2024.10666332 |