面试精选100题

数组、字符串

1.数组列表中的最大距离（中等）

给定 m 个数组，每个数组都已经按照升序排好序了。

现在你需要从两个不同的数组中选择两个整数（每个数组选一个）并且计算它们的距离。两个整数 a 和 b 之间的距离定义为它们差的绝对值 |a-b| 。

返回最大距离。

示例 1：

输入：[[1,2,3],[4,5],[1,2,3]]
输出：4
解释：
一种得到答案 4 的方法是从第一个数组或者第三个数组中选择 1，同时从第二个数组中选择 5 。

示例 2：

1 2	输入：arrays = [[1],[1]] 输出：0

提示：

m == arrays.length
2 <= m <= 105
1 <= arrays[i].length <= 500
-104 <= arrays[i][j] <= 104
arrays[i] 以升序排序。
所有数组中最多有 105 个整数。

解答：

class Solution:
    def maxDistance(self, arrays: List[List[int]]) -> int:
        res = 0
        n = len(arrays[0])
        min_val = arrays[0][0]
        max_val = arrays[0][-1]
        
        for i in range(1, len(arrays)):
            n = len(arrays[i])
            res = max(res, max(abs(arrays[i][n - 1] - min_val), 
                            abs(max_val - arrays[i][0])))
            min_val = min(min_val, arrays[i][0])
            max_val = max(max_val, arrays[i][-1])
        
        return res

2.摆动排序（中等）

给你一个的整数数组 nums, 将该数组重新排序后使 nums[0] <= nums[1] >= nums[2] <= nums[3]...

输入数组总是有一个有效的答案。

示例 1:

1
2
3

输入：nums = [3,5,2,1,6,4]
输出：[3,5,1,6,2,4]
解释：[1,6,2,5,3,4]也是有效的答案

示例 2:

1 2	输入：nums = [6,6,5,6,3,8] 输出：[6,6,5,6,3,8]

提示：

1 <= nums.length <= 5 * 104
0 <= nums[i] <= 104
输入的 nums 保证至少有一个答案。

进阶：你能在 O(n) 时间复杂度下解决这个问题吗？

解答：

class Solution(object):
    def wiggleSort(self, nums):
        """
        :type nums: List[int]
        :rtype: None Do not return anything, modify nums in-place instead.
        """
        nums.sort()
        temp = 0
        for i in range(1, len(nums)-1,2):
            # nums[i+1], nums[i] = nums[i], nums[i+1]
                temp = nums[i]
                nums[i] = nums[i+1]
                nums[i+1] = temp
        return nums 
# 时间复杂度O(n)
	def wiggleSort(self, nums):
        for i in range(len(nums)-1):
            if ((i%2 == 0 and nums[i] > nums[i+1]) or (i%2 == 1 and nums[i] < nums[i+1])):
                nums[i], nums[i+1] = nums[i+1], nums[i]

3.反转数字（简单）

给定一个数字 N，当它满足以下条件的时候返回 true：

原数字旋转 180° 以后可以得到新的数字。

如 0, 1, 6, 8, 9 旋转 180° 以后，得到了新的数字 0, 1, 9, 8, 6 。

2, 3, 4, 5, 7 旋转 180° 后，得到的不是数字。

易混淆数 (confusing number) 在旋转180°以后，可以得到和原来不同的数，且新数字的每一位都是有效的。

示例 1：

输入：6
输出：true
解释： 
把 6 旋转 180° 以后得到 9，9 是有效数字且 9!=6 。

示例 2：

输入：89
输出：true
解释: 
把 89 旋转 180° 以后得到 68，68 是有效数字且 89!=68 。

示例 3：

输入：11
输出：false
解释：
把 11 旋转 180° 以后得到 11，11 是有效数字但是值保持不变，所以 11 不是易混淆数字。

示例 4：

输入：25
输出：false
解释：
把 25 旋转 180° 以后得到的不是数字。

提示：

0 <= N <= 10^9
可以忽略掉旋转后得到的前导零，例如，如果我们旋转后得到 0008 那么该数字就是 8 。

题解

class Solution(object):
    def confusingNumber(self, n):
        """
        :type n: int
        :rtype: bool
        """
        dic = {'0':'0','1':'1','8':'8','6':'9','9':'6'}
        s = str(n)
        rev_s = ""
        for i in s:
            if i not in dic:
                return False
            rev_s = dic[i] + rev_s
        return rev_s!=s

4.字符串左右移

给定一个包含小写英文字母的字符串 s 以及一个矩阵 shift，其中 shift[i] = [direction, amount]：

direction 可以为 0 （表示左移）或 1 （表示右移）。
amount 表示 s 左右移的位数。
左移 1 位表示移除 s 的第一个字符，并将该字符插入到 s 的结尾。
类似地，右移 1 位表示移除 s 的最后一个字符，并将该字符插入到 s 的开头。

对这个字符串进行所有操作后，返回最终结果。

示例 1：

输入：s = "abc", shift = [[0,1],[1,2]]
输出："cab"
解释：
[0,1] 表示左移 1 位。 "abc" -> "bca"
[1,2] 表示右移 2 位。 "bca" -> "cab"

示例 2：

输入：s = "abcdefg", shift = [[1,1],[1,1],[0,2],[1,3]]
输出："efgabcd"
解释： 
[1,1] 表示右移 1 位。 "abcdefg" -> "gabcdef"
[1,1] 表示右移 1 位。 "gabcdef" -> "fgabcde"
[0,2] 表示左移 2 位。 "fgabcde" -> "abcdefg"
[1,3] 表示右移 3 位。 "abcdefg" -> "efgabcd"

解答：

class Solution(object):
    def stringShift(self, s, shift):
        """
        :type s: str
        :type shift: List[List[int]]
        :rtype: str
        """
        left_sum = 0
        right_sum = 0
        for direction, amount in shift:
            if direction == 0:
                left_sum += amount
            else:
                right_sum += amount
        total = (right_sum-left_sum)%len(s)
        if total < 0:
            total += len(s)
        # 切片操作
        return s[-total:] + s[:-total]

这行代码根据计算得到的 total 值对字符串 s 进行切片和拼接操作。

s[-total:]：获取字符串 s 从倒数第 total 个字符开始到末尾的子串。
s[:-total]：获取字符串 s 从开头到倒数第 total 个字符之前的子串。
s[-total:] + s[:-total]：将上述两个子串拼接起来，得到最终移动后的字符串，并返回。

5.相隔为1的编辑距离（中等）

给定两个字符串 s 和 t ，如果它们的编辑距离为 1 ，则返回 true ，否则返回 false 。

字符串 s 和字符串 t 之间满足编辑距离等于 1 有三种可能的情形：

往 s 中插入 恰好一个 字符得到 t
从 s 中删除 恰好一个 字符得到 t
在 s 中用 一个不同的字符 替换 恰好一个 字符得到 t

示例 1：

1
2
3

输入: s = "ab", t = "acb"
输出: true
解释: 可以将 'c' 插入字符串 s 来得到 t。

示例 2:

1
2
3

输入: s = "cab", t = "ad"
输出: false
解释: 无法通过 1 步操作使 s 变为 t。

提示:

0 <= s.length, t.length <= 104
s 和 t 由小写字母，大写字母和数字组成

解答：

class Solution(object):
    def isOneEditDistance(self, s, t):
        """
        :type s: str
        :type t: str
        :rtype: bool
        """
        len_s,len_t = len(s),len(t)
        # 保证s长度要小于t   
        if len_s > len_t:
            return isOneEditDistance(t,s)
        # 长度超过1的字符距离一定不满足题意
        if len_t - len_s > 1:
            return False
        # 找s和t的前len(s)项是否相同
        for i in range(len_s):
            # 出现一个字符不同
            if s[i] != t[i]:
                # 若长度相同则后面需要完全一样
                if len_s == len_t:
                    return s[i + 1:] == t[i + 1:]
                # 长度差为 1 时，s 从当前位置开始和 t 从下一个位置开始需相同
                else:
                    return s[i:] == t[i + 1:]
        # 前len(s)完全相同 
        return len_s + 1 == len_t

6.反转字符串中的单词Ⅱ（中等）

给你一个字符数组 s ，反转其中单词的顺序。

单词的定义为：单词是一个由非空格字符组成的序列。s 中的单词将会由单个空格分隔。

必须设计并实现原地解法来解决此问题，即不分配额外的空间。

示例 1：

1 2	输入：s = ["t","h","e"," ","s","k","y"," ","i","s"," ","b","l","u","e"] 输出：["b","l","u","e"," ","i","s"," ","s","k","y"," ","t","h","e"]

示例 2：

1 2	输入：s = ["a"] 输出：["a"]

提示：

1 <= s.length <= 105
s[i] 可以是一个英文字母（大写或小写）、数字、或是空格 ' ' 。
s 中至少存在一个单词
s 不含前导或尾随空格
题目数据保证：s 中的每个单词都由单个空格分隔

解答：

class Solution(object):
    def reverseWords(self, s):
        """
        :type s: List[str]
        :rtype: None Do not return anything, modify s in-place instead.
        """
        n = len(s)
        # 使用 Python 列表的 reverse() 方法将整个字符数组反转，这里reverse是创建了一个新的列表
        s.reverse()
        pre = 0
        for i in range(n):
            if s[i] ==' ':
                # 不可以用s[pre:i].reverse()，因为切片操作时这样会自动创建新的列表原始并没有修改
                s[pre:i] = reversed(s[pre:i])
                pre = i + 1
        # 处理最后一个单词，因为最后一个单词后面没有空格
        s[pre:] = reversed(s[pre:])
        return s

滑动窗口

1.至多包含两个不同字符的最长子串（中等）

给你一个字符串 s ，请你找出至多包含 两个不同字符 的最长子串，并返回该子串的长度。

示例 1：

1
2
3

输入：s = "eceba"
输出：3
解释：满足题目要求的子串是 "ece" ，长度为 3 。

示例 2：

1
2
3

输入：s = "ccaabbb"
输出：5
解释：满足题目要求的子串是 "aabbb" ，长度为 5 。

提示：

1 <= s.length <= 105
s 由英文字母组成

解答：

# 模板 (伪代码)
class Solution:
    def problemName(self, s: str) -> int:
        # Step 1: 定义需要维护的变量们 (对于滑动窗口类题目，这些变量通常是最小长度，最大长度，或者哈希表)
        x, y = ..., ...

        # Step 2: 定义窗口的首尾端 (start, end)， 然后滑动窗口
        start = 0
        for end in range(len(s)):
            # Step 3: 更新需要维护的变量, 有的变量需要一个if语句来维护 (比如最大最小长度)
            x = new_x
            if condition:
                y = new_y

            '''
            ------------- 下面是两种情况，读者请根据题意二选1 -------------
            '''
            # Step 4 - 情况1
            # 如果题目的窗口长度固定：用一个if语句判断一下当前窗口长度是否超过限定长度 
            # 如果超过了，窗口左指针前移一个单位保证窗口长度固定, 在那之前, 先更新Step 1定义的(部分或所有)维护变量 
            if 窗口长度大于限定值:
                # 更新 (部分或所有) 维护变量 
                # 窗口左指针前移一个单位保证窗口长度固定

            # Step 4 - 情况2
            # 如果题目的窗口长度可变: 这个时候一般涉及到窗口是否合法的问题
            # 如果当前窗口不合法时, 用一个while去不断移动窗口左指针, 从而剔除非法元素直到窗口再次合法
            # 在左指针移动之前更新Step 1定义的(部分或所有)维护变量 
            while 不合法:
                # 更新 (部分或所有) 维护变量 
                # 不断移动窗口左指针直到窗口再次合法

        # Step 5: 返回答案
        return ...

def lengthOfLongestSubstringTwoDistinct(s):
    # 1. 定义需要维护的变量
    # 最大长度max_len，设计计算不重复元素格式需要哈希表hashmap
    max_len, hashmap = 0, {}
    # 2. 定义窗口首尾（start，end），滑动窗口
    start = 0
    for end in range(len(s)):
        # 3. 更新维护变量（max_len,hashmap）
        # 当前元素计数加一，一旦哈希表长度≤2，尝试更新最大长度
        hashmap[s[end]] = hashmap.get(s[end],0) + 1
        if len(hashmap) <=2:
            max_len = max(max_len,end-start+1)
        # 4. 因为窗口长度可变: 这个时候一般涉及到窗口是否合法的问题
        # 这时要用一个while去不断移动窗口左指针, 从而剔除非法元素直到窗口再次合法
        # 当哈希表长度>2时，移动左指针，直到哈希表长度≤2
        while len(hashmap) > 2:
            head = s[start]
            hashmap[head] -= 1
            if hashmap[head] == 0:
                del hashmap[head]
            start += 1
    return max_len

# 同样道理
# 无重复字符的最长子串
class Solution:
    def lengthOfLongestSubstring(self, s: str) -> int:
        # Step 1: 定义需要维护的变量, 本题求最大长度，所以需要定义max_len, 该题又涉及去重，因此还需要一个哈希表
        max_len, hashmap = 0, {}

        # Step 2: 定义窗口的首尾端 (start, end)， 然后滑动窗口
        start = 0
        for end in range(len(s)):
            # Step 3
            # 更新需要维护的变量 (max_len, hashmap)
            # i.e. 把窗口末端元素加入哈希表，使其频率加1，并且更新最大长度
            hashmap[s[end]] = hashmap.get(s[end], 0) + 1
            if len(hashmap) == end - start + 1:
                max_len = max(max_len, end - start + 1)
            
            # Step 4: 
            # 根据题意,  题目的窗口长度可变: 这个时候一般涉及到窗口是否合法的问题
            # 这时要用一个while去不断移动窗口左指针, 从而剔除非法元素直到窗口再次合法
            # 当窗口长度大于哈希表长度时候 (说明存在重复元素)，窗口不合法
            # 所以需要不断移动窗口左指针直到窗口再次合法, 同时提前更新需要维护的变量 (hashmap)
            while end - start + 1 > len(hashmap):
                head = s[start]
                hashmap[head] -= 1
                if hashmap[head] == 0:
                    del hashmap[head]
                start += 1
        # Step 5: 返回答案 (最大长度)
        return max_len
#长度最小的子数组
class Solution:
    def minSubArrayLen(self, target: int, nums: List[int]) -> int:
        # Step 1: 定义需要维护的变量, 本题求最小长度，所以需要定义min_len, 本题又涉及求和，因此还需要一个sum变量
        min_len, sum_ = math.inf, 0

        # Step 2: 定义窗口的首尾端 (start, end)， 然后滑动窗口
        start = 0
        for end in range(len(nums)):
            # Step 3: 更新需要维护的变量 (min_len, sum_)
            sum_ += nums[end]
            if sum_ >= target:
                min_len = min(min_len, end - start + 1)

            # Step 4
            # 根据题意,  题目的窗口长度可变: 这个时候一般涉及到窗口是否合法的问题
            # 这时要用一个while去不断移动窗口左指针, 从而剔除非法元素直到窗口再次合法
            # 当sum_的值大于target时候，窗口不合法
            # 所以需要不断移动窗口左指针直到窗口再次合法, 同时提前更新需要维护的变量 (min_len， sum_)
            while sum_ >= target:
                min_len = min(min_len, end - start + 1)
                sum_ -= nums[start]
                start += 1
        # Step 5：返回答案 (最小长度)
        if min_len == math.inf:
            return 0
        return min_len

2.最大连续1的个数（中等）

给定一个二进制数组 nums ，如果最多可以翻转一个 0 ，则返回数组中连续 1 的最大个数。

示例 1：

输入：nums = [1,0,1,1,0]
输出：4
解释：翻转第一个 0 可以得到最长的连续 1。
     当翻转以后，最大连续 1 的个数为 4。

示例 2:

1 2	输入：nums = [1,0,1,1,0,1] 输出：4

提示:

1 <= nums.length <= 105
nums[i] 不是 0 就是 1.

进阶：如果输入的数字是作为 无限流 逐个输入如何处理？换句话说，内存不能存储下所有从流中输入的数字。您可以有效地解决吗？

解决：

class Solution(object):
    def findMaxConsecutiveOnes(self, nums):
        """
        :type nums: List[int]
        :rtype: int
        """
        start = 0
        zero_count = 0
        max_length = 0
        # 1 1 1 1 0 1 1 0 
        for end in range(len(nums)):
            if nums[end] == 0:
                zero_count += 1
            while zero_count > 1:
                if nums[start] == 0:
                    zero_count -= 1
                start += 1
            max_length = max(max_length, end - start + 1)
        return max_length

AAAI_CheckList模板

发表于 2024-08-14 更新于 2024-08-16
本文字数： 4.7k 阅读时长 ≈ 9 分钟

做了一个今年AAAI25的checklist的tex代码分享，无偿分享，有问题b站接着私信我就可以。

使用方法：新建一个checklist.tex文件复制下面进去，然后在主文件参考文献后面加上就可以：

1
2
3

\newpage
\clearpage
\input{./checklist}

checklist.tex：

\section{Reproducibility Checklist}
\paragraph{This paper:}
\begin{itemize}
    \item Includes a conceptual outline and/or pseudocode description of AI methods introduced (yes/partial/no/NA) {\bf yes}
    \item Clearly delineates statements that are opinions, hypothesis, and speculation from objective facts and results (yes/no) {\bf yes}
    \item Provides well marked pedagogical references for less-familiare readers to gain background necessary to replicate the paper (yes/no) {\bf yes}
\end{itemize}

\paragraph{Does this paper make theoretical contributions? (yes/no)} {\bf yes}

If yes, please complete the list below.
\begin{itemize}
    \item All assumptions and restrictions are stated clearly and formally. (yes/partial/no) {\bf yes}
    \item All novel claims are stated formally (e.g., in theorem statements). (yes/partial/no) {\bf yes}
    \item Proofs of all novel claims are included. (yes/partial/no) {\bf yes}
    \item Proof sketches or intuitions are given for complex and/or novel results. (yes/partial/no) {\bf yes}
    \item Appropriate citations to theoretical tools used are given. (yes/partial/no) {\bf yes}
    \item All theoretical claims are demonstrated empirically to hold. (yes/partial/no/NA) {\bf yes}
    \item All experimental code used to eliminate or disprove claims is included. (yes/no/NA) {\bf NA}
\end{itemize}

\paragraph{Does this paper rely on one or more datasets? (yes/no)} {\bf yes}

If yes, please complete the list below.
\begin{itemize}
    \item A motivation is given for why the experiments are conducted on the selected datasets (yes/partial/no/NA) {\bf yes}
    \item All novel datasets introduced in this paper are included in a data appendix. (yes/partial/no/NA) {\bf yes}
    \item All novel datasets introduced in this paper will be made publicly available upon publication of the paper with a license that allows free usage for research purposes. (yes/partial/no/NA) {\bf yes}
    \item All datasets drawn from the existing literature (potentially including authors’ own previously published work) are accompanied by appropriate citations. (yes/no/NA) {\bf yes}
    \item All datasets drawn from the existing literature (potentially including authors’ own previously published work) are publicly available. (yes/partial/no/NA) {\bf yes}
    \item All datasets that are not publicly available are described in detail, with explanation why publicly available alternatives are not scientifically satisficing. (yes/partial/no/NA) {\bf yes}
\end{itemize}

\paragraph{Does this paper include computational experiments? (yes/no)} {\bf yes}

If yes, please complete the list below.
\begin{itemize}
    \item Any code required for pre-processing data is included in the appendix. (yes/partial/no) {\bf no}
    \item All source code required for conducting and analyzing the experiments is included in a code appendix. (yes/partial/no) {\bf no}
    \item All source code required for conducting and analyzing the experiments will be made publicly available upon publication of the paper with a license that allows free usage for research purposes. (yes/partial/no) {\bf yes}
    \item All source code implementing new methods have comments detailing the implementation, with references to the paper where each step comes from. (yes/partial/no) {\bf yes}
    \item If an algorithm depends on randomness, then the method used for setting seeds is described in a way sufficient to allow replication of results. (yes/partial/no/NA) {\bf yes}
    \item This paper specifies the computing infrastructure used for running experiments (hardware and software), including GPU/CPU models; amount of memory; operating system; names and versions of relevant software libraries and frameworks. (yes/partial/no) {\bf yes}
    \item This paper formally describes evaluation metrics used and explains the motivation for choosing these metrics. (yes/partial/no) {\bf yes}
    \item This paper states the number of algorithm runs used to compute each reported result. (yes/no) {\bf yes}
    \item Analysis of experiments goes beyond single-dimensional summaries of performance (e.g., average; median) to include measures of variation, confidence, or other distributional information. (yes/no) {\bf yes}
    \item The significance of any improvement or decrease in performance is judged using appropriate statistical tests (e.g., Wilcoxon signed-rank). (yes/partial/no) {\bf yes}
    \item This paper lists all final (hyper-)parameters used for each model/algorithm in the paper’s experiments. (yes/partial/no/NA) {\bf yes}
    \item This paper states the number and range of values tried per (hyper-) parameter during development of the paper, along with the criterion used for selecting the final parameter setting. (yes/partial/no/NA) {\bf yes}
\end{itemize}

每日计划（长期）

发表于 2024-01-15 更新于 2024-06-26 分类于生活安排
本文字数： 11k 阅读时长 ≈ 21 分钟

个人每日计划安排，用Github备份一下，不对外开放噢

阅读全文 »

注意力机制

发表于 2024-01-02 分类于课内学习，传统算法学习，李沐《动手学深度学习》
本文字数： 17k 阅读时长 ≈ 31 分钟

1.目标

学习注意力机制的基本理论思想，代码实现底层逻辑。同时学习多头注意力机制、自注意力和位置掩码、Transformer。

阅读全文 »

循环神经网络（RNN）及现代RNN

发表于 2023-12-20 更新于 2023-12-27 分类于课内学习，传统算法学习，李沐《动手学深度学习》
本文字数： 17k 阅读时长 ≈ 30 分钟

1.目标

学习RNN的基本结构，并以此延伸出来的现代循环神经网络：GRU、LSTM、深度循环神经网络、双向循环神经网络、编码器-解码器结构、序列到序列学习（seq2seq）、束搜索。

通过自己大致复现一遍经典循环神经网络结构，熟悉代码的架构和模板。这里基础知识尽可能精简，多写一些模型和不同方法适用的场景。

阅读全文 »

卷积神经网络

发表于 2023-12-18 更新于 2023-12-19 分类于课内学习，传统算法学习，李沐《动手学深度学习》
本文字数： 19k 阅读时长 ≈ 35 分钟

1.Lenet和现代经典卷积神经网络

手敲一遍过去三四十年的传统神经网络，加深一下对于网络演变的认识，给自己后续学习新进的网络结构打好基础。

主要是分为：

传统卷积神经网络：LeNet
现代卷积神经网络：AlexNet、VGG、NiN、GoogleLeNet、Resnet、DenseNet

阅读全文 »

多层感知机

发表于 2023-12-17 分类于课内学习，传统算法学习，李沐《动手学深度学习》
本文字数： 2.1k 阅读时长 ≈ 4 分钟

1.目标

简单实现多层感知机，并学习其他方法：模型选择、欠拟合过拟合、权重衰减(L1、L2正则化)、暂退法（Dropout）、前向传播和反向传播、数值稳定性和模型初始化、环境和分布偏移。

阅读全文 »

Learning Action Completeness from Points for Weakly-supervised Temporal Action Localization

发表于 2023-12-13 分类于课内学习，论文精读
本文字数： 506 阅读时长 ≈ 1 分钟

1.Abstract

我们要解决的问题是，在每个动作实例只有一个帧标签的情况下，如何定位动作的时间间隔，以进行训练。由于标签稀疏，现有工作无法学习动作的完整性，从而导致零碎的动作预测。在本文中，我们提出了一个新颖的框架，即生成密集的伪标签，为模型提供完整性指导。具体来说，我们首先选择伪背景点来补充点级动作标签。然后，通过将点作为种子，我们搜索可能包含完整动作实例的最佳序列，同时与种子达成一致。为了从获得的序列中学习完整性，我们引入了两种新的损失，分别从动作得分和特征相似性方面对动作实例和背景实例进行对比。实验结果表明，我们的完整性指导确实有助于模型定位完整的动作实例，从而大幅提高了性能，尤其是在高 IoU 阈值下。此外，我们还在四个基准测试中证明了我们的方法优于现有的先进方法： THUMOS'14、GTEA、BEOID 和 ActivityNet。值得注意的是，我们的方法甚至可以与最新的全监督方法相媲美，而注释成本却低 6 倍。

阅读全文 »

2023-12-10：（YOLO）Car detection for Autonomous Driving

发表于 2023-12-10 分类于课内学习，传统算法学习，吴恩达课程学习， Convolutional Neural Networks
本文字数： 9.5k 阅读时长 ≈ 17 分钟

1.任务

构建YOLO算法的各个模块：Bounding Box预测、交并比（IoU）、非极大值抑制、Anchor Boxes。

然后利用构建的YOLO算法进行目标识别（汽车检测）。

阅读全文 »

Proposal-based-Multiple-Instance-Learning-for-Weakly-supervised-Temporal-Action-Localization（基于提案的弱监督时序动作定位多示例学习）

发表于 2023-12-07 更新于 2023-12-09 分类于课内学习，论文精读
本文字数： 1.8k 阅读时长 ≈ 3 分钟

1.Abstract

弱监督时态动作定位的目的是在训练过程中仅使用视频级类别标签来定位和识别未剪辑视频中的动作。在没有实例级注释的情况下，大多数现有方法都遵循基于片段的多实例学习（S-MIL）框架，即通过视频标签对片段预测进行监督。然而，在训练过程中获取分段级分数的目标与在测试过程中获取建议级分数的目标并不一致，从而导致了次优结果。为了解决这个问题，我们提出了一种新颖的基于提案的多实例学习（Proposal-based Multiple Instance Learning，P-MIL）框架，在训练和测试阶段直接对候选提案进行分类，其中包括三个关键设计：

周边对比特征提取模块，通过考虑周边对比信息来抑制辨别性短提案；
提案完整性评估模块，通过完整性伪标签的指导来抑制低质量提案；
实例级等级一致性损失，通过利用 RGB 和 FLOW 模式的互补性来实现鲁棒检测；

在两个具有挑战性的基准（包括 THUMOS14 和 ActivityNet）上取得的大量实验结果证明了我们的方法具有卓越的性能。

阅读全文 »

SST: Single-Stream Temporal Action Proposals（单流时间动作建议）

发表于 2023-12-06 更新于 2023-12-08 分类于课内学习，论文精读
本文字数： 3.8k 阅读时长 ≈ 7 分钟

1.Abstract

本文提出了一种新的方法，用于长时间未修剪视频序列中人体动作的时间检测。我们引入单流时序动作提案( SST )，这是一种新的有效且高效的生成时序动作提案的深度架构。我们的网络可以在很长的输入视频序列上以单个流连续运行，而不需要将输入分割成短的重叠片段或时间窗口进行批处理。我们在实验中证明了我们的模型在时间动作提议生成任务上的表现优于当前最先进的模型，同时达到了一些文献中最快的处理速度。最后，我们证明了将SST建议与现有的动作分类器结合使用，可以提高最先进的速度。

本论文主要提出了一个新进的Proposals方法，用于目标检测和动作识别。

阅读全文 »

2023-12-5：Residual-Networks（ResNet）

发表于 2023-12-04 更新于 2023-12-17 分类于课内学习，传统算法学习，吴恩达课程学习， Convolutional Neural Networks
本文字数： 11k 阅读时长 ≈ 21 分钟

1.任务

先一个个实现ResNets的网络模块，然后将这些模块拼到一起构建ResNets来实现图像分类。

阅读全文 »

野生调参侠的博客

leetcode刷题日记

面试精选100题

数组、字符串

1.数组列表中的最大距离（中等）

2.摆动排序（中等）

3.反转数字（简单）

4.字符串左右移

5.相隔为1的编辑距离（中等）

6.反转字符串中的单词Ⅱ（中等）

滑动窗口

1.至多包含两个不同字符的最长子串（中等）

2.最大连续1的个数（中等）

AAAI_CheckList模板

每日计划（长期）

注意力机制

1.目标

循环神经网络（RNN）及现代RNN

1.目标

卷积神经网络

1.Lenet和现代经典卷积神经网络

多层感知机

1.目标

Learning Action Completeness from Points for Weakly-supervised Temporal Action Localization

1.Abstract

2023-12-10：（YOLO）Car detection for Autonomous Driving

1.任务

Proposal-based-Multiple-Instance-Learning-for-Weakly-supervised-Temporal-Action-Localization（基于提案的弱监督时序动作定位多示例学习）

1.Abstract

SST: Single-Stream Temporal Action Proposals（单流时间动作建议）

1.Abstract

2023-12-5：Residual-Networks（ResNet）

1.任务