从Python中删除heapq需要O(logn)

4smxwvx5  于 2024-01-05  发布在  Python
关注(0)|答案(5)|浏览(148)

我有一个这样的堆(Python,heapq模块)-

  1. >>> h = []
  2. >>> heappush(h, (5, 'write code'))
  3. >>> heappush(h, (7, 'release product'))
  4. >>> heappush(h, (1, 'write spec'))
  5. >>> heappush(h, (3, 'create tests'))

字符串
我如何在O(logn)中删除项目值为“创建测试”的元组并保留堆属性?
这是我能想到的(不是O(logn))

  1. for i in range(len(h)):
  2. if h[i][1] == "create tests":
  3. h[i], h[-1] = h[-1], h[i]
  4. popped = h.pop()
  5. heapq.heapify(h)
  6. break

muk1a3rh

muk1a3rh1#

如果您确实需要从heap中取出一个项目,但又想保留heap,您可以延迟执行,并在项目自然出现时丢弃它,而不是在列表中搜索它。
如果您将要删除的项目存储在黑名单set中,则每次heapq.heappop检查该项目是否在set中。如果存在,则丢弃它,然后再次heappop,直到您获得未列入黑名单的项目,或者heap为空

alen0pnh

alen0pnh2#

如果多个被删除的元素具有相同的值,那么黑名单集将是有问题的。相反,使用tombstone-counting-dict实现heap_remove

  1. def heap_remove(heap, value):
  2. tombstones[value] = tombstones.get(value, 0) + 1
  3. while len(heap) and heap[0] in tombstones and tombstones[heap[0]]:
  4. heappop(heap)

字符串
正如预期的那样,你已经摊销了O(1)的移除时间,并且只要你不在其他地方的堆上popping,堆的top总是准确的。
考虑这个数字列表,它们首先被全部推入堆,然后以相同的顺序“移除”(而不是弹出):
[三、三、二、七、一、四、二]
按预期工作:

  1. After inserting 3 into heap, top = 3
  2. After inserting 3 into heap, top = 3
  3. After inserting 2 into heap, top = 2
  4. After inserting 7 into heap, top = 2
  5. After inserting 1 into heap, top = 1
  6. After inserting 4 into heap, top = 1
  7. After inserting 2 into heap, top = 1


但是移除是通过增加对象的tombstone来完成的。如果堆的顶部设置了tombstone,那么移除对象并递减tombstone计数器。

  1. Called remove( 3 )
  2. Marking 3 for deletion
  3. Called remove( 3 )
  4. Marking 3 for deletion
  5. Called remove( 2 )
  6. Marking 2 for deletion
  7. Called remove( 7 )
  8. Marking 7 for deletion
  9. Called remove( 1 )
  10. Marking 1 for deletion
  11. Deleting top 1
  12. Updated heap is: [2, 2, 3, 7, 3, 4]
  13. Deleting top 1
  14. Updated heap is: [2, 3, 3, 7, 4]
  15. Called remove( 4 )
  16. Marking 4 for deletion
  17. Called remove( 2 )
  18. Marking 2 for deletion
  19. Deleting top 2
  20. Updated heap is: [3, 3, 4, 7]
  21. Deleting top 3
  22. Updated heap is: [3, 7, 4]
  23. Deleting top 3
  24. Updated heap is: [4, 7]
  25. Deleting top 4
  26. Updated heap is: [7]
  27. Deleting top 7
  28. Updated heap is: []


请注意,当第二个heap_remove(3)被调用时,@GP89的解决方案会崩溃,因为3已经在tombstone集中。
您可以探索此示例here

展开查看全部
j9per5c4

j9per5c43#

有了以上两个想法,这里是一个完整的演示:我会尽快使它干净简洁。

  1. from heapq import heappush, heappop
  2. class Solution:
  3. def demo():
  4. deleted = {}
  5. h = [0]
  6. heappush(h, 789)
  7. heappush(h, 101)
  8. heappush(h, 101)
  9. self.remove(h, 101, deleted)
  10. max_val = self.peek(h, deleted)
  11. def remove(self, h, y, deleted):
  12. deleted[y] = deleted.get(y, 0) + 1
  13. while len(h) > 0 and h[0] == y and deleted[y] > 0:
  14. heappop(h)
  15. deleted[y] -= 1
  16. def peek(self, h, deleted):
  17. while len(h) > 0 and deleted.get(h[0],0) > 0:
  18. deleted[h[0]] -= 1
  19. heappop(h)
  20. return h[0]

字符串

展开查看全部
ycl3bljg

ycl3bljg4#

在这种方法中,我基本上是在字典中跟踪元素。所以,无论何时,我删除它,搜索过程都变成了O(1)。

  1. import heapq
  2. import collections
  3. import itertools
  4. class RemoveHeap:
  5. def __init__(self):
  6. self.h = []
  7. self.track = collections.defaultdict(collections.deque)
  8. self.counter = itertools.count()
  9. def insert_item(self, val):
  10. count = next(self.counter)
  11. item = [val, count, 'active']
  12. self.track[val].append(item)
  13. heapq.heappush(self.h, item)
  14. def delete_item(self, val):
  15. if val in self.track:
  16. items = self.track[val]
  17. for item in items:
  18. if item[2] == 'active':
  19. item[2] = 'deleted'
  20. break
  21. def pop_item(self):
  22. while len(self.h) > 0:
  23. item = heapq.heappop(self.h)
  24. item_track = self.track[item[0]]
  25. item_track.popleft()
  26. if len(item_track) == 0:
  27. del self.track[item[0]]
  28. else:
  29. self.track[item[0]] = item_track
  30. if item[2] == 'active':
  31. return item[0]
  32. def peek_item(self):
  33. item = self.h[0]
  34. if item[2] == 'deleted':
  35. x = self.pop_item()
  36. self.insert_item(x)
  37. return x
  38. return item[0]

字符串

展开查看全部
dwbf0jvd

dwbf0jvd5#

恐怕只有heapq没有这样的方法。因为从堆中搜索元素需要O(n)
但是,您可以将它与dict之类的东西一起使用,这将为O(1)提供搜索条目的时间。

已删除:

我尝试使用字典记账,但我如何才能得到“创建测试”的索引时,它被插入?- Prakhar 3小时前
一种简单的方法是:

  1. # remember to update this hdict when updating the heap.
  2. hdict = { h[i][1]: i for i in range(len(h)) }

字符串
然后你可以通过访问这个hdict而不是你的O(n)线性搜索来获得给定字符串的索引。

相关问题