simplify the KJT.split function when segment is the original KJT #3014

TroyGarden · 2025-05-29T04:40:27Z

Summary:

context

in KJT.split function, when the segment == len(keys), the returned KJT contains the same data as the original KJT
however in the function it recreates a new one which introduces extra cost
this diff remove the redundent KJT creation

analysis

when segment == len(keys), start has to be zero so the stride_per_key_per_rank is the original one.
the following KJT init produces the same KJT as self

KeyedJaggedTensor(
    keys=self._keys,
    values=self._values,
    weights=self.weights_or_none(),
    lengths=self._lengths,
    offsets=self._offsets,
    stride=self._stride,
    stride_per_key_per_rank=stride_per_key_per_rank,
    stride_per_key=None,
    length_per_key=self._length_per_key,
    lengths_offset_per_key=None,
    offset_per_key=self._offset_per_key,
    index_per_key=self._index_per_key,
    jt_dict=self._jt_dict,
    inverse_indices=None,
)

Differential Revision: D70756397

Summary: # context * in KJT.split function, when the segment == len(keys), the returned KJT contains the same data as the original KJT * however in the function it recreates a new one which introduces extra cost * this diff remove the redundent KJT creation # analysis * when segment == len(keys), start has to be zero so the stride_per_key_per_rank is the original one. * the following KJT init produces the same KJT as self ``` KeyedJaggedTensor( keys=self._keys, values=self._values, weights=self.weights_or_none(), lengths=self._lengths, offsets=self._offsets, stride=self._stride, stride_per_key_per_rank=stride_per_key_per_rank, stride_per_key=None, length_per_key=self._length_per_key, lengths_offset_per_key=None, offset_per_key=self._offset_per_key, index_per_key=self._index_per_key, jt_dict=self._jt_dict, inverse_indices=None, ) ``` Differential Revision: D70756397

facebook-github-bot · 2025-05-29T04:40:38Z

This pull request was exported from Phabricator. Differential Revision: D70756397

…orch#3014) Summary: # context * in KJT.split function, when the segment == len(keys), the returned KJT contains the same data as the original KJT * however in the function it recreates a new one which introduces extra cost * this diff remove the redundent KJT creation # analysis * when segment == len(keys), start has to be zero so the stride_per_key_per_rank is the original one. * the following KJT init produces the same KJT as self ``` KeyedJaggedTensor( keys=self._keys, values=self._values, weights=self.weights_or_none(), lengths=self._lengths, offsets=self._offsets, stride=self._stride, stride_per_key_per_rank=stride_per_key_per_rank, stride_per_key=None, length_per_key=self._length_per_key, lengths_offset_per_key=None, offset_per_key=self._offset_per_key, index_per_key=self._index_per_key, jt_dict=self._jt_dict, inverse_indices=None, ) ``` Differential Revision: D70756397

…orch#3014) Summary: # context * in KJT.split function, when the segment == len(keys), the returned KJT contains the same data as the original KJT * however in the function it recreates a new one which introduces extra cost * this diff remove the redundent KJT creation # analysis * when segment == len(keys), start has to be zero so the stride_per_key_per_rank is the original one. * the following KJT init produces the same KJT as self ``` KeyedJaggedTensor( keys=self._keys, values=self._values, weights=self.weights_or_none(), lengths=self._lengths, offsets=self._offsets, stride=self._stride, stride_per_key_per_rank=stride_per_key_per_rank, stride_per_key=None, length_per_key=self._length_per_key, lengths_offset_per_key=None, offset_per_key=self._offset_per_key, index_per_key=self._index_per_key, jt_dict=self._jt_dict, inverse_indices=None, ) ``` Reviewed By: iamzainhuda Differential Revision: D70756397

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label May 29, 2025

facebook-github-bot added the fb-exported label May 29, 2025

facebook-github-bot closed this in 22078ab May 29, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

simplify the KJT.split function when segment is the original KJT #3014

simplify the KJT.split function when segment is the original KJT #3014

Uh oh!

TroyGarden commented May 29, 2025

Uh oh!

facebook-github-bot commented May 29, 2025

Uh oh!

Uh oh!

simplify the KJT.split function when segment is the original KJT #3014

simplify the KJT.split function when segment is the original KJT #3014

Uh oh!

Conversation

TroyGarden commented May 29, 2025

context

analysis

Uh oh!

facebook-github-bot commented May 29, 2025

Uh oh!

Uh oh!