找到能够整除数组中整数之间互不差的最小正整数

Question

Catarina Nogueira

Asked: 2024-04-19 21:55:33 +0800 CST2024-04-19 21:55:33 +0800 CST 2024-04-19 21:55:33 +0800 CST

列表中值总和最多为 K 的最大元素数

772

我有一个练习要求：

考虑一个具有 n 个正整数和一个整数 k 的向量 T。提出一种算法，从 T 中选择最大数量的元素，使得所选元素的总和小于或等于 k。

我所做的是：

 def findSumCount(L, k):
  L.sort()
  totalSum = L[0]
  count = 1
  for i in range(1, len(L), 1):
    if(totalSum + L[i] <= k):
      count = count + 1
      totalSum = totalSum + L[i]
    if(totalSum + L[i] > k): break

  return count

findSumCount([1, 2, 3, 4], 5)

我认为当我对条目上的数组进行排序时，这是有效的（如果您认为它不起作用，请给我一个反例）。但是，如果我之前没有排序（即在没有事先排序的情况下迭代数组），我就看不到前进的道路。有任何想法吗？

3 个回答

Voted

Tomas Tillmann · Answer 1 · 2024-04-19T22:12:20+08:00

获取总和低于某个值的尽可能多的元素k意味着您希望始终选择具有最低值的元素。因此，对输入数组进行排序，然后按升序排列元素，直到k超出限制是您能做的最好的事情。复杂度低于 O(n logn + n) ~ O(n logn)，其中 n 是 T 的大小。您为排序所付出的成本:)。

trincot · Answer 2 · 2024-04-19T22:22:49+08:00

如果您不想通过排序来改变输入列表，则始终可以获得与输入列表不同的排序列表。为此，您只需将初始语句替换为L = sorted(L). 现在L持有的列表与最初的列表不同。

如果k相对较小，即要求和的元素数量相对于列表的大小相对较小，那么不对列表进行排序，而是从中构建堆可能会更有效。然后从中提取最小值，直到得到计数：

from heapq import heapify, heappop

def findSumCount(L, k):
    L = L[:] # optional in case you don't want to mutate the original L
    heapify(L)  # turn it into a heap
    totalSum = 0
    n = len(L)
    for count in range(n):
        totalSum += heappop(L)
        if totalSum > k:
            return count
    return n

复制列表（可选）和堆化列表的时间复杂度为 O(𝑛)。每次从堆中弹出的时间复杂度为 O(log𝑛)，其中 𝑛 是当时堆的大小。因此，如果我们只需从堆中弹出 𝑚 个元素，时间复杂度为 O(𝑛 + 𝑚log𝑛)。

在最坏的情况下，如果所有元素都必须从列表中弹出，时间复杂度将为 O(𝑛log𝑛)。在这种情况下，它的时间复杂度与您提供的原始代码相同，但开销更大，因此实际上会更慢。

对您的代码进行备注

您的实施有几个问题：

如果列表中的所有元素都大于k，则不会返回正确答案，在这种情况下返回的计数应为 0。
if(totalSum + L[i] > k)当它已经执行了前面的块时，它也会评估表达式if，其中L[i]已经添加到totalSum：这有点奇怪。它不会导致错误的结果，因为列表已排序，但最好将第二个转换if为else，甚至更好：反转第一个的条件if并返回那里的计数。您至少可以节省执行添加的次数。
由于count确实对应于排序列表中当前迭代值的索引，因此您可以将其设为循环变量。同时您可以通过获取值enumerate。

考虑到这些评论，可能是：

def findSumCount(L, k):
    L.sort()
    totalSum = 0
    for count, val in enumerate(L):
        totalSum += val
        if totalSum > k:
            return count
    return len(L)

wLui155 · Answer 3 · 2024-04-19T23:29:37+08:00

如前所述，完全有可能在不进行排序的情况下解决这个问题，甚至通过结合二分搜索来解决很多辅助空间。此问题依赖于以下属性（已被识别）：

最好先将较小的元素放入数组中，然后再将较大的元素放入数组中，因为较小的元素对总和的贡献较小，并且我们希望选取尽可能多的元素，同时将所选元素的总和保持在以下k。

扭曲这个想法，我们可以通过找到最大的元素来最大化选择的元素数量，limit其中我们选择所有小于或等于的元素limit。一种查找方法limit是测试数组中的每个值，但实际上我们可以通过仅选择某些候选值来limit使用二分搜索来做得更好，因为我们只是想最大化limit并可以立即快速丢弃太小/太大的候选值。

完整的代码如下，是findSumCountBsearch这个想法的实现；它要求helper评估候选人是否按照提议的方式工作limit。还有一种边缘情况，limit可以安全地包含所有小于的元素，但实际上只能选择所有实例的子集limit，这可以通过一些数学来处理。

import random
from math import ceil

BAD, OK, DONE = 0, 1, 2
def findSumCountBsearch(L, k):
    """
    Determine the largest number of elements in L that can be selected
    s.t the sum of the selected elements is smaller than or equal to k
    
    Does so via binary search on the biggest element we can pick; works
    bc if we want to include value x into the sum, we have to allow
    all values smaller than x into it anyways
    """
    lo, hi = min(L), max(L) # binary searching on min/max; array doesn't need to be sorted!
    res_limit, res_cnt = 0, 0
    while lo < hi:
        mid = lo + (hi - lo) // 2
        status, cnt = helper(L, k, mid)
        if status == BAD: # search left
            hi = mid
        else:
            res_cnt = cnt
            res_limit = mid
            if status == DONE:
                return res_cnt
            
            lo = mid + 1
            
    if lo <= res_limit: # can't do any better
        return res_cnt
        
    status, cnt = helper(L, k, lo)
    if status != BAD:
        res_cnt = cnt
        
    return res_cnt

def helper(L, k, limit):
    """
    Can we pick all elements in L that are smaller than limit
    where the sum is smaller than or equal to k?
    
    Also need to handle the case where we pick the largest number <=
    limit but specifically, a few of them
    """
    res, curr_sum, curr_max, max_cnt = 0, 0, None, 0
    for x in L:
        if x <= limit:
            res += 1
            curr_sum += x
            if curr_max is None or x > curr_max:
                curr_max = x
                max_cnt = 1
            elif x == curr_max:
                max_cnt += 1
            
    if curr_sum <= k:
        return (1, res)
        
    # curr_sum above k so see if we can remove some of the largest elements
    max_remove = max_cnt * curr_max
    if curr_sum - max_remove > k: # still too big
        return (0, -1)
        
    # want p, the number of times we need to remove the largest element,
    # to satisfy curr_sum - p * curr_max <= k and p is as small as possible
    # => (curr_sum - k) / (curr_max) <= p
    p = ceil((curr_sum - k) / curr_max)
    return (DONE, res - p)
    
def findSumCount(L, k):
    """
    Your solution but modified a little
    """
    L = sorted(L) # avoid mutation
    if L[0] > k: # handle edge case
      return 0
      
    totalSum = L[0]
    count = 1
    for i in range(1, len(L), 1):
        if(totalSum + L[i] <= k):
            count = count + 1
            totalSum = totalSum + L[i]
        if(totalSum + L[i] > k): break
    
    return count
  
def generate_test():
    L = [random.randint(-10, 1000) for _ in range(random.randint(5, 100))]
    k = random.randint(min(L), max(L) * random.randint(1, len(L) // 2))
    return (L, k)
    
if __name__ == "__main__":
    for L, k in [
        ([1,2,3,4], 5),
        ([4,2,1,3], 5), # some permutation of the above
    ]:
        print(f"{L =}, {k =}, result: {findSumCountBsearch(L, k) = }")
        assert findSumCountBsearch(L, k) == findSumCount(L, k), f"Mismatch for {(L, k)}"
    
    for L, k in [generate_test() for _ in range(1000)]: # some bigger tests to catch more cases
        act, exp = findSumCountBsearch(L, k), findSumCount(L, k)
        assert act == exp, f"Mismatch for {(L, k)}; {act} vs {exp}"

列表中值总和最多为 K 的最大元素数

对您的代码进行备注

为什么双破折号 (--) 会导致此 MariaDB 子句评估为 true？

AdMob：MobileAds.initialize() - 对于某些设备，“java.lang.Integer 无法转换为 java.lang.String”

ELF 重定位的应用顺序在哪里指定？

为什么 GCC 生成有条件执行 SIMD 实现的代码？

Selenium urllib.error.HTTPError：HTTP 错误 404：未找到

Box::new() 会从堆栈复制到堆吗？

sizeof("string") 的正确输出是什么？

使用 <font color="#xxx"> 突出显示 html 中的代码

我正在尝试仅使用海龟随机和数学模块来制作吃豆人游戏

C++17 中 std::byte 只能按位运算？

列表中值总和最多为 K 的最大元素数

3 个回答

对您的代码进行备注

相关问题